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INCREASED PRODUCTION OF SECRETES) 
PROTEINS BY RECOMBINAnTEUKABYOTIC CELLS 

Fie Id M 7 the Invention 

This invention relates to cells which have been genetically manipulated to have an 
elevated unfolded protein response {UPR} resulting in an increased capacity to produce 
secreted proteins. 

The secretory pathway of eukaryotic organisms Is of interest sine© cells cars be 
engineered to secrete a particular protein of Interest The secretory pathway starts by 
translocation of the protein into the lumen of the endoplasmic reticulum {ER}. In the ER the 
proteins fold info their final three-dimensional conformation and the core part of the 
gfycans are attached to them, A ■quality control mechanism Involving the proteins cainexin 
and cafreticuSin also resides in the ER, letting only completely folded proteins continue on the 
secretory pathway to the next compartment (Hammond and Helenius, 1 995. Purr. Opinion 
pell Biol, 7:523-529), Secretory proteins that do not fold property are transported hack to the 
cytoplasm fay the translocation machinery and are degraded by the proteasorne system 
(Wierte et a|„ 1996, Nature 384:432-438), 

The folding and glycosyiatioo of the secretory proteins in the ER is assisted by 
numerous ER-resident proteins. The chaperones like 8ip (GRP78), GRPS4 or yeast Lhsl p 
help the secretory protein to fold by binding to exposed hydrophobic regions in the unfolded 
states and preventing unfavourable interactions (Blond-Eiguindi eta!., 1993, Ceil 75:717- 
728). The chaperones are also important for the translocation of the proteins through the ER 
membrane. The foldase proteins like protein disuiphide isomerase and Its homologs and 
proSyl-peptsdyi eisrtrans isomerase assist in formation of disuiphlde bridges and formation of 
the right conformation <of the peptide chain adjaeentfo proline residues, respectively, A 
machinery including many protein components also resides in the ER for the addition of the 
N-iinked core gfycans to the secretory protein and for the Initial trimming steps of the giycans, 

The levels of the chaperone and fokfas© proteins found in the ER are regulated at the 
transcriptional level For each gene there is a basic level of transcription that can be 
increased in response to various stimul A large amount of secretory protein In the ER 
{secretory load) can induce the mammalian GRF78 gene, and this induction is mediated 
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through the HF~xB transcription factor (Pah! and BaeuerlalSSS, EMBO J, 14:2580-2588). 
Furthermore, the ER chaperone and foidase genes are upregulated when the amount of 
unfolded protein increases in the ER This Induction has been named unfolded protein 
response (UPR) and it has been described In mammalian cells, yeast and filamentous fungi 
(McMillan etat, 1994, Cum Opinion in Btotedinei 5:540-545), The induction can be caused 
by treatment of cells with reducing agents like DTT, by Inhibitors of core giycosyiafion like 
tunikamycin or by Ca-ionophores that deplete the ER calcium stores. The promoters of 
mammalian and yeast genes regulated by UPR have a conserved sequence region called 
UPR element, where the transcription factor responsible for the induction binds. 

When the unfolded protein response pathway is active, a signal is tranduced from the 
ER lumen to the transcription machinery in the nucleus. A protein implicated: In the UPR 
induction is the IRE1 protein of yeast (Cox et at., 1 993, GeJ 73:1 1 97-1 208, Mori si aL, 1 993, 
Cs« 74i'1,43-1 56). It Is large protein having a transmembrane segment anchoring the protein 
to the ER membrane. A segment of the IRE1 protein has homology to protein kinases and 
the C-fermfnai tall has some homology to RNAses, it is believed that the IRE1 protein may 
ha the first component of the UPR signal transduction pathway, sensing the ER lumen for 
the presence of unfolded proteins and transmitting the signal eventually to a transcription 
factor inducing the ER-proteln genes, it has been reported that the {RSI protein 
oiigomerizes and gets phosphoryiated when the UPR is activated (Shamu and Walter, 1096, 
EMBOJv 1 5:3028-3039). Over-expression of the IRE1 gene in yeast leads to constitutive 
induction of the UPR (!&)> Phosphorylation of the IRE1 protein occurs at specific serine or 
threonine residues in the protein, 

Another protein reportedly Implicated in the regulation of the UPR pathway is RTC2, a 
yeast protein phosphatase encoded by the PTC2 gene {Welihinda etat, 1998, Mol Cell. 
Biol. 18, 1987-1977), The IRE1 protein is phosphoryiated when the UPR pathway is turned 
on (Shamu and Walter, 1998, BUBO J. 1 5:3928*3038), and PTC2 dephosphoryiates the 
IRE1 protein and regulates the UPR, 

It has further been reported that the yeast transcription factor mediating the UPR 
induction of the chaperons and foidase genes is the HAC1 protein (Cox and Walter, 1996, 
Cell 87:391 -404, Sidraysk) eta!,, 1998, Sell 87:405-413). It belongs to the bZIP family of 
transcription factors, having a basic DNA-binding region and a leucine zipper dlmerisation 
domain. The binding of the HAC1 pmtein to the UPR element of ER-profein gene promoters 
has been demonstrated (Mod et aL, 1998, J. Biol. Cham, 273: 9912-9920). The action of the 
HACl protein is regulated by its amount in the cells; none of the protein can be found in 
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uninduced cells and upon UPR induction it appears rapidly. The HAC1 protein amount is 
dependent of the splicing of the respective m&NA, In unindueed conditions the intron 
present in the HACi gene dose to the translation termination codon is not spliced off, and 
this intron prevents the formation of HAG1 protein by preventing the translation of the mRNA 
(Chapman and Walter, 1937, Cum Biol, ?, S50*8§8, Kawahara et at, 1997, Mol Biol. Cell a, 
1 845-1882). When UPR is induced, the Intron is spliced and the mRNA Is translated to form 
BAG 1 protein that activates the promoters of Its target genes. The HACi intron is spliced by 
an mechanism not currently described for any other system, involving the RNAse activity of 
the IRE1 protein and a tRNA isgase {Sidrauski and Walter, 1997, Cejf90 ( 1031-1039, 
Gonzatesetat, 1999, EMBO J. 18,8119-3132. Sidrauski et at, 1998, Cej| 87, 485-413), 
The unfolded protein response can be Induced constltuttvely in yeast by transformation with 
a UPR inducing version of the HACI gene. (Cox and Walter, supra.) 

Thus, as indicated above, there are a number of reports regarding the secretory 
pathway, Additionally, there are reports on how to increase secretion so as to provide 
greater yields of heterologous proteins. Greater yields of protein are generally of ■interest to 
industry to provide more o? a particular protein and to facilitate purification. 

For example, In one report random mutagenesis of the host organism has been 
performed followed by screening for increased yield of a secreted protein. In another report, 
there has been fusion of a heterologous protein to an efficiently secreted endogenous 
protein in order to Increase the yield of secretion of the heterologous protein. Both of these 
methods have been of limited success and other methods to improve protein secretion are 
desirable. 

In other studies, there has reportedly been increased yields of secreted heterologous 
proteins In yeast by either over-expression or deletion of the yeast ER foldase or chaperons 
genes on an individual or pairwtee basis. For example, over-expression of either the protein 
dlsuSphide isomerase (PDI) or the KAR2 (homologous to the gene for the mammalian ER 
ehapsrone BP) genes In yeast has been shown to increase the extracellular accumulation of 
certain secreted heterologous proteins (Robinson etal, 1996, Bio/Technology, 12:381-384; 
Harmsen, et al, 1998, Appi. Miorohtol, Siotechnot, 48:385-370). In contrast deletion of the 
CNE1 gene, encoding an ER chaperone homologous to mammalian cainexin, reportedly can 
lead to increased secretion of a heterologous protein (Pariati etal, 1995, J, Biot Chem. 
270:244-253, Harmsen, supra.}. The effect of over-expression or deletion of individual or 
pairs of ER ehaperonss or foldases has also been reported on in filamentous fungi, however, 
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increasetil secretion was not obtained. (Pant, ef aL, 1998, Appi. Microbiol Biotech, 50:447- 
454; Wang, et al, 2000, Current ^nefics, 37:S7-84), 

Therefore, It Is desirable to provide new methods to increase production of secreted 
proteins in eukaryotlc ceils which are simple and consistent it Is also desirable to provide 
compositions such as novel genes to be used in methods for the increased production of 
secreted proteins, Si is further desirable to provide eukaryotic ceils according to the invention 
which are transformed with heterologous genes so as to have an increased capacity to 
produce secreted proteins. 

Summary of the Invention 

Provided herein are methods tor increasing the secretion of a heterologous protein in 
a cell comprising inducing an elevated unfolded protein response (UPR), The increase in 
protein secretion is compared to a level of protein secreted by the ceil when the UPR is not 
elevated by the methods described herein. In one aspect, the method includes inducing the 
elevated UPR by increasing the presence of the HA01 protein in the ceil. In one aspect Of 
the invention, the presence of the HAC1 protein can he increased by a number of methods. 
For example, the HAC1 gene can be overexpressed compared to its native state, 
Overexpression can be achieved by a variety of ways including the use of preferred vectors 
and promoters as further described herein. In one embodiment, the HAC1 protein is 
increased in a cell by transformation of said cell by a nucleic acid comprising a UPR Inducing 
form of a HAC1 recombinant nucleic acid, 

The HACl nucleic acid encoding a H.AC1 protein can be from a variety of sources, It 
is understood that in one embodiment, HAC1 Is used interchangeably with had , haoA, etc,, 
and one embodiment is meant to encompass HAC1 hornoiogs. Additionally, the skilled 
artisan can ascertain by the context whether the HACl is a nucleic acid, protein or either, in 
one embodiment, a HAC1 nucleic acid is isolated from yeast, in another embodiment, a 
HACl nucleic acid Is isolated from filamentous fungi such as Thoftodemw or Aspergillus. 

In another aspect of the invention, the elevated UPR Is induced by modulating the 
levels of {BE1 protein or PTC2 protein in said cell Nucleic acids encoding IRE1 or PTC2 
can be isolated from yeast or filamentous fungi such as TrichQdemm or Aspergillus, In a 
preferred embodiment the nucleic acid encoding 1RE1 or PTC2 is isolated from 7". mesei, A 
mdulsns or A. niger, 
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The eel! from which the protein Is secreted can be any cell having an UPR, Ceils 
having an UPR include ail eukaryotes Including but not limited to mammalian ceils, insect 
ceils, yeast and filamentous fungi 

Also provided herein is an isolated nucleic acid encoding a HAC1 protein, wherein 
said HAC1 has unfolded protein response inducing activity and has less than 50% similarity 
to yeast HAC1 protein, in another embodiment an Isolated nucleic acid encoding a HAC1 
protein is provided, wherein said HAC1 protein has unfolded protein response Inducing 
activity and wherein said HAC1 comprises a DNA binding region that has greater than 70% 
similarity to the DNA binding region of filamentous fungi HAC1 . Embodiments of a DNA 
binding region are shown at amino acids 84-14? of the T, reeset protein shown in Figure 10, 
at amino acids 53-1 1 6 of the A &$atan$ protein shown in Figure 10, and at amino acids 45- 
1 09 of the A nig&r protein shown in Figure 28. in one embodiment, the HACl protein 
encoded by the HACl nucleic add provided herein has an amino acid sequence having 
greater than 70% similarity to the sequence of Figure 7, Figure 8 or Figure 28, The proteins 
encoded by such nucleic acids are also provided herein. 

In one embodiment, the nucleic acid provided herein encodes an amino acid 
sequence as set forth in Figure 7, Figure 8 or Figure 28. hi yet another embodiment, the 
nucleic acid provided herein has a nucleic acid sequence as set forth in Figure 7, Figure 8 or 
Figure 28. The proteins encoded by such nucleic adds are also provided herein. 

Further provided herein Is an isolated nucleic acid encoding a FTCS protein wherein 
said PTC2 protein modulates unfolded protein response and wherein said FTC2 protein has 
at least 70% similarity to the amino add sequence of Figure 24 or Figure 25. In preferred 
embodiments the PTC2 protein has preferably at least 80%, more preferably at least 90% or 
more preferably at least 95% similarity to said amino acid sequences. In one aspect, the 
PTC2 protein has an amino acid sequence as set forth in Figure 24 or Figure 25. In another 
aspect the PTC2 nucleic acid has a nucleic acid sequence as set forth in Figure 24 or 
Figure 25, The proteins encoded by such nucleic acids are also provided herein. It is 
understood that as used herein, PTG2 can be used interchangeably with ptc2 and pteS, and 
that In one embodiment, PTC2 encompasses homoiogs. Moreover, the context in which the 
term is used will determine whether PTC2 is a nucleic acid, a protein or either. 

Also provided herein is a nucleic aosd encoding an IRE1 protein having unfolded 
protein response modulating activity arid having at least 80% similarity to an amino acid 
sequence as shown in Figure 26 or Figure 27. in preferred embodiments the IRE1 protein 
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has at least 70%, preferably at least 80%, more preferably at least 00% or -even more 
preferably at least 85% similarity to said amino acid sequences. In one aspect, SRE1 has an 
amino acid or nucleic acid sequence as shewn in Figure 26 or Figure 27, it Is understood 
that as used herein, 1RE1, feel and IrsA can be used Interchangeably, and that in one 
embodiment, 1RE1 Includes homologs, Moreover, the context in which the term is used will 
determine whether IRE1 is a nucleic acid, a protein or either. 

The nucleic acids provided herein may be obtained from a variety of sources 
including but not limited to filamentous fungi such as Trichodetma and Aspergillus, in a 
preferred embodiment the nucleic- acids are obtained from T, r&es&L. A. nidulans or A niger. 

Also provided herein is a cell containing a heterologous nucleic acid encoding a 
protein having unfolded protein response modulating activity and a heterologous nucleic acid 
encoding a protein of interest to be secreted. In one aspect said protein having unfolded 
protein response modulating activity fs selected from the group consisting of HAC1 , PTC2 
and IRE1 . in another embodiment, said protein of interest is selected from the group 
consisting of lipase, ceifuiase, endo-giucosidase H, protease, earbohydrase, reductase, 
oxidase, isomeraae, transferase, kinase, phosphatase, aipha-amylase, gfucoarnyiase, 
lignocellulose hemiceiiulase, pectinase and ligninase. 

Further aspects of the invention will be understood by the skilled artisan as further 
described below. 

Brief Desc ription of the Figures 

Figure 1 depicts a map of the piasmid pMS109, an embodiment of a plasmid 
constructed for the expression of the truncated yeast HAC1 gene. 

Figure 2 depicts a graph showing activity of o-amylase produced from yeast 
containing pMS109 (squares) or an empty control vector pKKl (diamonds) in the vertical bar. 
overtime, horizontal bar, and further showing the activity is greater wherein p!v1S1Q9 is 
present. 

Figure 3 depicts a bar graph showing activity of Invertase produced from yeast 
containing pMS1 09 (black bars) or" ah empty control vector pKK1 (shaded bars) In the 
vertical bar, over time, horizontal bar, and further showing the activity is greater wherein 
pMSlOO is present. 
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Figure 4 depicts a graph showing activity of d-amyiase produced from yeast wherein 
RAC1 has been disrupted (diamonds) or from Its parental control strain (squares) in the 
vertical har s over time, horizontal oar. and further showing that the activity Is greater wherein 
HAG1 has not been disrupted, 

Figure 5 depicts a graph showing activity of Trichodmma rmsei(T. R&e$&i) 
endogiucanase SGI produced from yeast wherein HAC1 has been disrupted (diamonds) or 
from its parental control strain (squares) in the vertical bar, over time, horizontal bar, and 
further showing thai the activity Is greater wherein HAC1 has not been disrupted. 

Figure & depicts a map of the plasmid pMS1 19. where the full-length 7. reeee/HACI 
cDNA without the SO bp intron is in the pBluescrlpt SIC vector. 

Figure 7 depicts an embodiment of a nucleotide (SEQ ID No, 1) and deduced amino 
acid sequence (SEQ ID No, 2) ot T> reese/HACI. The Introns are shown in lower case 
letters. 

Figure 8 depicts an embodiment of a nucleotide (SEQ ID No, 3) and deduced amino 
acid sequence (SEQ ID No. 4) of Aspergillus nidulans (A niduims) hacA. The introns are 
shown In lower case letters. 

figure 9 depicts the hakpin loop structures forming at the §' end of the 20 bp introns 
in the T. reesei HAG 1 and A mdulans hacA mRNAs and at the 3' end of the Intron of the S. 
c&revbias HAG1 mRNA. The conserved nucleotides in the loop region are shown In bold. 
The cleavage site of the yeast intron and the three possible cleavage sites of the 7* r&mei 
HAC1 intron are shown by arrows. Alignment of the 20 bp intron areas of the T, reesei 
HAC1 and A niduiam hacA is shown below. The intron is in lower case. 

Figure 10 depicts an amino acid sequence alignment of the T, mssei HAC1 » A, 
nldums hacA and $. cefavfsia& HAG1 proteins. Identical amino acids are shown by 
asterisks and similar ones by dots. Yeast HAC1 is homologous to the other sequences at 
the DMA binding domain area. The DNA binding domain Is approximately at amino acids 84- 
147 for T, reesei (SEQ ID No, 5), and approximately at amino acids 53-1 18 for A. mdulans 
(SEQ ID No. 8), 
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Figure 1 1 depicts Northern hybridlzatJen of RNA samples derived from 71 mesei 
myeelia treated with DTT !>OTT} and untreated control myeeiia (-DTT). The fimepoints (In 
minutes) after DTT addition are shown. The probes used for hybridization are shown on the 
left. 

Figure 12 depicts Northern hybndizstson of RHA samples derived from A nktuism 
myeslia treated with DTT C+DTT) and untreated control myeelia (-DTT), The timepoints alter 
DTT addition are shown. The probes are shown on the left. 

Figure 1 3 depicts a map of the piasmid pMS131 , where the full-length T, m&sei 
HAC1 cDNA without the 20 bp intron is under the yeast PGK1 promoter in the vector 
pAJ4Q1. 

Figure 14 depicts a map of the plasmid pMS13£ t whore the T. m&sw HAGI cDNA 
without the 5' flanking region and without the 20 bp intron is under the yeast PGK1 promoter 
In the vector pAJ401 , 

Figure 1 5 depicts complementation of S, G&r&i4sfa& HAC1 arid IBE1 disruptions 
(DHAC1 and DIRE 1, respectively) with different forms of the T. reese/ BACI cDNA, The 
growth of transformanfs on media with and without inositol is shown, pAJ401 is the 
expression vector without any insert, pMS131 has the full-length 7. reeseiHACI esBNAin 
pAJ4G1 . pMS1 32 has the T. reesei HAC1 oDNA without its 5* flanking region In pAJ4Q1 , 

Figure 16 depicts bandshift experiments, where the binding of the malE-HACI fusion 
protein to the putative UPR element sequences found in 7. reesmpdii and i>ip1 promoters 
was tested. The oligonucleotides used in the binding reactions are shown on the top. Lanes 
1, 12 and 16, no protein; lanes 2, 4-7, 8-11, 13-15 and 17-19, malE-HACI fusion protein; lane 
3, rnalE protein alone. The binding was competed with uniabeiied oligonucleotides on lanes 
5 (20 x excess); lanes 6, 1 0, 14 and 18 (SOX excess) and lanes 7, 11, 18, and 1 9 {200 x 
excess). Alignment of the UPR element sequences that hind the HACI-malE protein is 
shown below. 

Figure 17 depicts a graph which shows actrvity of o-amylase by yeast strains 
expressing the 7. imsel HAC1 cDNA without the §' banking region and the 20 bp intron 
(pMS132) (squares) and control strains with the expressjon vector alone (pA j401) 
(diamonds) in the vertical bar over time, horizontal bar, and which further shows that activity 
is greater wherein pfv1S132 is present. 
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Figure 1 8 depicts a bar graph which shows activity of frwertase by yeas! strains 
expressing the 71 rms&t HAC1 ePNA without the 5' flanking region and the 20 bp iniron 
(pMS132) and control strains with the expression vector aione (pAJ401) In the vertical bar, 
over time (horizontal bar) and which f urther shows that activity is greater is greater wherein 
pMS132 Is present. 

Figure 19 depicts Northern hybridization of RNA samples from a yeast strain 
expressing the T. re&seiHAGt cDNA without the S ! flanking region and the 20 bp infron 
CpMSl 32) and a control strain with the expression vector alone (pAJ401). The probes used 
for hybridization are shown. The signals were quantified with a phospholmager and the 
KAR2 signal intensities were normalised with respect to the TDH1 signal intensities. The 
normalised KARS signals are shown on the bottom wherein it is shown that pMS1 32 has 
greater signal 

Figure 20 depicts a map of the plasmid pMS138, where the T. reeseiHApl cDNA 
without the 5' flanking region and the 20 bp intron is under the A. nidutens gpdA promoter in 
the vector pAN52-Notl. 

Figure 21 depicts Northern hybridization of RNA samples derived from transformation 
of the plasmid pMSi 36 into a T. mes&i strain producing CBNhchymosin fusion protein. 
Samples from the parental strain (lanes 1 , S and Q), two positive transformers (lanes 2 S 3, 6, 
7, 1 0 and 1 1 ) and a HAC1 mutant strain desginatecl number 31 generated in the 
transformation (lanes 4, 8 and 12} are shown. The growth times are shown on the fop and 
the probes used for the hybridization on the felt. Quantifications of the pdil and bipl signals 
normalised with respect to gpdl signals are shown on the bottom. 

Figure 22 depicts Northern hybndfeation of RNA samples derived from myeslia of the 
HAC1 mutant strain number 31 treated with DTT (*DTT) and untreated control mycella ( - 
DTT}. The flmepoints after DTT addition are shown on the top and the probes used for 
hybridization on the left. Quantifications of the pdil signals normalised with respect to gpdl 
signals are shown on the bottom. 

Figure 23 is a graph depicting production of calf ehyniosin by the HAC1 mutant 
fransfomiant number 31 {diamonds} and Its parental strain (squares) during a shake flask 
culture. The ehymosin (CHV) units per ml of culture are shown (vertical bar) over time 
(horizontal bar), and it is shown that the control has. more units than the mutant 
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Figure 24 depicts an embodiment of a nucleotide (SEQ ID No. 7) and deduced amino 
acid sequence (SEQ ID No, 8) of the fragment isolated from the A ntdukmspicB gene. The 
intron is shown In lower cass. 

Figure 25 depicts an embodiment of a nucfeoiide (SEQ ID No. 9} and deduced amino 
acid sequence (SEQ ID No. 10} of the T. t&es&l ptc2 cDNA, 

Figure 26 depicts an embodiment of a nucleotide (SEQ ID No. 11} and deduced 
amino acid sequence (SEQ ID No. 12) of the fragment isolated from the A, nidulam sreA 
gene. The intron is shown in lowercase. 

Figures 27A-27C depict an embodiment of a nucleotides (SEQ ID No, 13} and 
deduced amino acid sequence (SEQ ID No. 14} of the T. reese/lREl gene. The intron is 
shown in lower case. 

Figure 28A-280. The nucleotide (SEQ ID No. 15} and deduced amino acid sequencs 
(SEQ ID No, 16) of Aspergillus nlg&rv&t. awamorl hacA eDNA, The 20hp unconventional 
intron (SEQ ID No, 17} is shown in lower case Setters, The amino acid sequences of the 
upstream open reading frame (SEQ ID No, 18} and the HACA protein (SEQ ID No, 19) are 
shown below the nucleotide sequence. 

Figure 29. Map of the plasmid pMSl52 where the Aspergillus nlgervex, awamorl 
hacA without the 5' flanking region and the 20bp Intron Is under control of the Aspergillus 
nigervm, awamonglaA promoter. 

Figure SG.The levels of chymosin activity measured in supernatants from duplicate 
cultures of strain AAP3pUCpy/GRG3#1 1 (ctrt} and transfomnants (#1 , #2, #3 and #4) of this 
strain with p]MS1 52. 

Figure 31 ,The levels of iaeease activity measured in supernatants from duplicate 
cultures of strain AAP4rpGP7iaecase (ctrl) and transformants (#1 : #2, #3, #4 ; #5, #8 S #7 and 
#8) of this strain with pMS152. 
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Detailed Desedpffo o of the Invention 

The Invention \ml\ now be described in detail by way of ref erence only using the 
following definitions and examples. All patents and publications, inducting all sequences 
disclosed within such patents and publications, referred to herein are expressly incorporated 
by reference. 

Provided herein are methods and compositions for increasing the secretion of a 
protein in a ceil comprising inducing an elevated unfolded protein response (UPR). The 
compositions provided herein include nucleic acids, proteins, and cells. 

In one embodiment UPR refers to the unfolded protein response which occurs in 
response to an increase in unfolded protein In the ER, in a method provided herein, the 
UPR is elevated. In one embodiment, "elevated" UPR refers to an increase in the response 
compared to the response which would have been induced based on the amount of unfolded 
protein in tile ER. In one embodiment elevated refers to an increase with respect to the 
length of time the response occurs, in each embodiment, the elevated UPR results in an 
increased capacity for the cell to produce secreted proteins compared to another ceil of the 
same type containing the same amount of unfolded protein in the ER, Preferably, the cell 
having an elevated UPR in accordance with the present invention produces more secreted 
protein in the same amount of time as a cell not having an elevated UPR, 

In one aspect, the method includes Inducing the elevated UPR by modulating the 
amount or presence of one or more UPR modulating proteins in said cell, in one 
embodiment, the UPR modulating protein is selected from the group consisting of HAC1 , 
PTC2 or IRE1 . UPR modulating proteins are further discussed below, it Is understood that 
the modulating protein can be obtained by increasing the presence of a nucleic acid which 
encodes a modulating protein. The protein used in the methods herein have UPR 
modulating activity as further discussed below, and the nucleic acids encode a protein which 
has UPR modulating activity. Modulating means that an increase in the protein can lead to 
an increase or a decrease in the UPR, Thus, in one embodiment, the presence o! a 
modulating protein is increased as further discussed below to reach an elevated UPR, in 
another embodiment, the modulating protefnte decreased or eliminated to reach an elevated 
UPR, in a preferred embodiment HA01 and/or IRE1 are increased so as to reach an 
elevated UPR. 
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In oris embodiment, inducing UPR means that the unfolded protein response as a 
whole is induced or maintained as jtwo^fd-be'^^URfekS^-pfOt^ft in the ER, The unfolded 
protein response Involves Increased expression and regulation of multiple EB foldases and 
chaperones. Thus., In one embodiment, manipulation of ER foldases or chaperones on an 
individual gene basis would not be considered an induction of UPR, Thus, in a preferred 
embodiment, UPR modulating activity results In an elevated UPR wherein an elevated UPR 
results In upreguiatlon of ER chaperones and foldases and increased secretion of proteins. 

The nucleic acids encoding the UPR modulating proteins can he obtained from a 
variety of sources. Preferred organisms include but are not limited to SaccharcmyG&s 
mmvisiae, Aspergillus spp. and Trichoderma spp. Also other suitable yeasts and other 
fungi, such as Schlzosaccharomyc&s pombe, Ktuyveromyc&s lactis, Ptchta spp., Hamenula 
spp., Fusaiiiim spp., Neocosporaspp. and P&nictiliiimspp. can be used. Homologous 
genes from other organisms can also be used. In one aspect, homologous genes refer to 
genes which are related, but not identical, in their DNA sequence and/or perform the same 
function are homologous with each other and are called each other's homologies,. 

HA01, PTC2, or IRE1 amino acid and nucleic acid sequences have been described 
for yeast. For example, for RAC1 , see GenBank accession number E1 5694; for FTC2, see 
Gen Bank accession number U72498; for IRE'h see GenBank accession number 21 1701, 
Sequences of GenBank accession numbers are incorporated herein by reference, GenBank 
Is known In the art, see, e.g., Benson, DA, el at, Nucleic Acids Research 26:1-7 (1098) and 
http://www.ncbl.njm.nih.gov/. in one embodiment, HAC1, FTC2, or 1RE1 are Isolated from a 
species other than yeast, preferably a filamentous fungi, insect cell, mammalian ceil or other 
eukaroyote. Sequences for HAC1 are provided in Figures 7, 8 and 28. Sequences for 
FTC2 are provided in Figures 24 and 25. Sequences for IRE1 are provided in Figures 26 
and 27, 

In one embodiment, the UPR modulating sequences are identified by hybridization to 
other nucleic acids. Additionally, sequence homology determinations can be made using 
algorithms. 

Thus In one embodiment, the UPR modulating nucleic acid hybridizes to a 
complement of a nucleic acid encoding HAC1 , FTG2 or IRB1 . in one embodiment, the 
HAG1, FTC2 or IRE1 encoding sequence is selected from the sequences provided In the 
respective figures, in one embodiment the stringency conditions are moderate. In another 
embodiment, the conditions used are high stringency conditions. 
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"Stringency* of hybridteatiori reactions is readily determinable by one of ordinary skill 
in the art, and generally Is an empricai calculation dependent upon probe length, washing 
temperature, and salt concentration, fn general, longer probes require higher temperatures 
for proper annealing, while shorter probes need lower temperatures. Hybridization generally 
depends on the ability of denatured DNA to reanneal when complementary strands are 
present in an environment below their melting temperature. The higher the degree of 
desired homology between the probe and hyohdizabia sequence, the higher the relative 
temperature which can be used As a result it follows that higher relative temperatures 
would tend to make the reaction conditions more stringent, while lower temperatures less so. 
For additional details and explanation of stringency of hybridization reactions, see Ausubel 
at ah, Current Protocols in Molecular Biology. Wiley interseienee Publishers, (1995). 

"Stringent conditions" or "high stringency conditions 8 , as defined herein, may be 
Identified by those that; (1) employ low ionic strength and high temperature for washing, for 
example 0,015 U sodium chloride/0,0015 M sodium citrate/0.1% sodium dpdecyl sulfate at 
S0*G- (2) employ during hybridization a denaturing agent, such as formamide, for example, 
50% (v/v) formamide with 0.1% bovine serum aibumin/0.1% Ficoii/0,1% 
potyvlnylpyrrolidone/oOmrvl sodium phosphate buffer at pH 8.5 with 760 mM sodium chloride, 
75 mM sodium citrate at 42 S C; Of (3) employ 50% formamide, 5 xSSC $.75 M NaCS, 0.076 
M sodium citrate), 50 mM sodium phosphate {pH 6,8), 0,1% sodium pyrophosphate, 5 x 
Denhardt's solution, sonicated salmon sperm DNA (50 ug/ml), 0.1% SDS, end 10% dexfran 
sulfate at 42°C, with washes at 42*0 in 0.2 x SSC (sodium chloride/sodium citrate) and 50% 
formamide at 55°C S followed by a high-stringency wash consisting of 0.1 x SSC containing 
EDTA at 5S°C, 

"Moderately stringent conditions" may be identified as described by Sambrook ef at. 
Molecular Cloning: A Laboratory Manual New York: Cold Spring Harbor Press, 1 989, and 
include the use of washing solution and hybridization conditions (e.g., temperature, Ionic 
strength and %SDS) less stringent that those described above. An example of moderately 
stringent conditions is overnight incubation at 37*0 in a solution comprising: 20% 
formamide, 5 x SSC (150 mM NaCI, 15 mMtrisodiam citrate), 50 mM sodium phosphate (pH 
7,8), 5 x Denhardf s solution, 10% dextran sulfate, and 20 mg/ml denatured sheared salmon 
sperm DNA, followed by washing the filters in 1 x SSC at about 37-50*0. The skilled artisan 
will recognize how to adjust the temperature, ionic strength, etc, as necessary to 
accommodate factors such as probe length and the like. 
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Homologous {similar or M^<^); : ^U§rjee8',<^rt:iSlso be determined by using a 
"sequence comparison algorithm* Optimal alignment of sequences for comparison cm be 
conducted, e.g., by the local Homology algorithm of Smith & Waterman, Adv. AppL Math, 
2:482 (1981), by file homology alignment aigonthm of Needleman & Wunsoh, J, Mot BM 
48:443 (1 070), by the search for similarity method of Pearson & Upman, Proc. Nat! Acad, 
Scl USA 85:2444 {1988}, by computerised implementations of these algorithms (GAP, 
8ESTFIT, PASTA, and TFASTA In the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Dr., Madison, Wl), or by visual Inspection, 

An example of an algorithm that is suitable for determining sequence similarity Is the 
BLAST algorithm, which is described in Aitschui, ©fa/,, J. Mol. Biol. 215:403-410 (1990), 
Software for performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information (hf|pi//www.nebl.film.nih.gov/). This algorithm involves first 
identifying high scoring sequence pairs (HSPs) by identifying short words of length W In the 
query sequence that either match or satisfy some positive-valued threshold score T when 
aligned with a word of the same length fh a database sequence. These initial neighborhood 
Word hits act as starting points to find longer HSPs containing them. The woitj hits are 
expanded in both directions along each of the two sequences being compared for as far as 
the cumulative alignment score can be increased. Extension of the word hits Is stopped 
when: the cumulative alignment score falls off by the quantity X from a maximum achieved 
value; the cumulative score goes to zero or beiow; Or the end of either sequence Is reached. 
The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the 
alignment, The BLAST program uses as defaults a wordlength (W) of 1 1 , the 8LOSUM62 
scoring matrix (see Hen&off & Henikoff, Pkxx Natl. Acad. Sol USA 89:10815 (1989)} 
alignments (9) of 50, expectation (E) of 10, U~S t N--4, and a comparison of both strands. 

The BLAST algorithm then performs a statistical analysis of the similarity between 
two sequences {.see, e.g., Karfirt & Altschui, Proa Natl Acad. ScL USA 90:5873-5787 
(1093)). One measure of similarity provided fey fhe BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
two nucleotide or amino acid sequences would occur by chance. For example, an amino 
acid sequence is considered similar to a protein such as a protease if the smallest sum 
probability In a comparison of the test amino acid sequence to a protein such as a protease 
amino acid sequence is less than about 0.1 more preferably less than about 0.01, and most 
preferably less than about 0.001 . 
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\n one embodiment, the HAC1 protem provided herein has less than 80% sequence 
similarity than the HAC1 yeast protein, see for example, GeoBank accession number 
E15694, more preferably, less than 70%, more preferably, less than 80%, more preferably 
less than 50%, more preferably, less, than 45% or 40% similarity, in another embodiment 
identity is substituted for similarity. 

in another embodiment the BAC1 protein provided herein has at least 40% similarity 
to the amino acid sequence set forth in Figure 7 or Figure 8. More preferably, the similarity is 
at least S0%, more preferably, at least 60%, more preferably at least 70%, more preferably at 
least 80%, more preferably at least 90%, and more preferably at least 95% or 98%. In 
another embodiment identity is substituted for similarity 

lb another embodiment, the HAC1 protein provided herein comprises a DMA binding 
domain that has at least 70% similarity to the DMA binding domain set forth in Figure 10. 
yore preferably, the similarity is at least 70%, more preferably at least 80%, more preferably 
at least 90%, and more preferably at least 95% or 98%. in another embodiment, identity Is 
substituted for similarity. 

As used herein, DMA binding domain refers to the domain which binds to the 
conserved sequence called the UPR element in promoters of genes regulated by UPR, 
Embodiments of a DHA binding region are shown approximately at amino acids 84-1 47 of 
the 7. rms&i protein shown In Figure 1 0, approximately at amino acids 53-1 18 of the A, 
ntdutam protein shown in Figure 10 and approximately amino acids 45-109 of the A, niger 
protein shown in Figure 28. HAC1 homoiogs will have DMA binding domains which can be 
identified by activity or by alignment to the binding domains in Figure 10. 

in one embodiment, the PTC2 protein provided herein has less than 80% sequence 
similarity than the PTC2 yeast protein, see for example, GenBank accession number 
U472498, more preferably, less titan 70%. more preferably, less than 80%, more preferably 
less than 50%, more preferably, less than 45% or 40% similarity. In another embodiment, 
Identity is substituted for similarity. 

In another embodiment, the PTC2 protein provided herein has at least 40% similarity 
to the amino acid sequence set forth In Figure 24 or Figure 25, More preferably, the 
similarity is at least 50%, mere preferably, at least 60%, more preferably at least 70%, more 
preferably at least 80%, more preferably at least 90%, and more preferably at least 95% or 
98%. In another embodiment, identity is substituted for "similarity. 
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In one embodiment, the iRE1 protein provided herein has less than 80% sequence 
similarity than the I REl yeast protein, see for example, GenSank accession number 21 1 701 , 
more preferably, less than 70%, more preferably, less than 60%, more preferably less than 
50%, mors preferably, less than 45% or 40% similarity. In another embodiment, identity is 
substituted for similarity. 

In another embodiment, the IRE1 protein provided herein has at least 40% similarity 
to the amino acid sequence set forth in Figure 28 or Figure 27. More preferably, the 
similarity is at least 50%, more preferably, at least 60%, more preferably at least 70%, more 
preferably at least 80%, more preferably at least 00%, and more preferably at least 05% or 
98%. in another embodiment, Identity is substituted for similarity. 

Additionally, further homologs of the UPR modulating sequences can be identified for 
example by using PGR primers based on the sequences provided herein, in yet another 
embodiment, naturally occurring allelic variants of the sequences provided herein may be 
used, 

A protein: has UPR modulating activity if if Is able to regulate the induction of UPR, 
Regulate means causing an increase or decrease in the induction of the UPR, A UPR 
modulating protein can increase or decrease UPR induction whether or not there is a change 
in the amount unfolded protein In the ER. In a preferred embodiment, a UPR modulating 
protein has one or more of the following activities; HAC1 activity, PTC2 activity. IRE1 activity, 
or binds to HAC1, 

Modulating the amount of or activity of the UPR modulating protein can occur by a 
variety Of methods. For example, to Increase the presence or activity of a protein in a ceil, 
one can over-express the nucleic acid encoding the UPR modulating protein. Over- 
expression as used herein means that the protein encoded by the said gene is produced in 
increased amounts in the cell, in one embodiment, over-expression can be used 
Interchangeably with constitutive expression or upreguiatfon. This can he achieved by 
increasing the copy number of the gene by introducing extra copies of the gene into the call 
on a plasmld or integrated into the genome. Over-expression can also be achieved by 
placing the gene under a promoter stronger than its own promoter. The amount of the 
protein in the cell can he varied by varying the copy number of the gene and/or the strength 
of the promoter used for the expression. Thus, manipulation of genes to cause induction of 
UPR may Involve insertion Into the host of multiple copies of a gene with its native promoter 
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either on a replicating autosomal plasmid or by integration into the chromosomal DNA, It 
may involve fusion of the gen© wife a promoter region and/or transcriptional control 
sequences from other genes to further increase expression or to allow controlled, inducible 
expression. Agonists and enhancers may also be used. 

In me case where It is desired to reduce me activity of a UPR modulating protein to 
result in elevated UPR, a number of methods may foe used such as deletion of a gene or the 
use of antisense nucleic acids to reduce the expression of a gene, if may involve alteration 
of a gone to provide a mutant form of the protein or Include the use of an Inhibitor of a UPR 
modulating protein. 

In one embodiment UPR is elevated by using a UPR inducing form of a recombinant 
nucleic aotd encoding a UPR-modulaimg protein, in one embodiment, a UPR-inductng form 
of a recombinant nucleic acid encoding a UPR-mociuiating protein is a nucleic acid which 
has been modified to give rise to a translatable mRNA, The translatable form mimics the 
modified mRNA which appears in the cell on induction of UPR and which can be translated 
to an active UPR- modulating protein. 

In one embodiment, a UPR~induclng form of a recombinant nucleic acid includes 
coding sequence. Coding sequence as used herein includes the nucleic acid sequence 
which leads to the amino acid sequence of the protein in its active form. As used herein, a 
nucleic acid consisting essentially o? a coding sequence explicitly excludes, lacks or omits at 
least infernal sequence which does not get translated when the active protein Is encoded. 
Infernal sequence as used herein refers to sequence which is internal to the carfcoxyt 
terminus and the amino terminus. Examples of excluded internal sequence are shown in 
small letters in Figures ?, 8, 24, 28, 27 and 28. The sequence may he excluded by deletion 
or truncation by methods known in the art. 

in one embodiment a nucleic aeM comprises a sequence consisting essentially of 
coding sequence. In this embodiment, the nucleic add may comprise vector sequence on 
eithe r side of the coding sequence but the coding sequence excludes internal sequence 
which does not get translated In the encoded protein's active form. 

in another embodiment,, a UPR modulating protein Is a variant UPR modulating 
protein which has been varied to have increased activity. Thus in one embodiment, the 
activity of a UPR modulating protein is Increased to elevate UPR. In one embodiment, the 
activity of a UPR modulating protein is increased by maintaining the protein in its active state. 
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For example, IRE1 is phosphoryiated when the UPR pathway is turned on. Therefore, In 
one embodiment herein, maintaining 1R£i in its phosphorated induces an elevated UPR 

In a preferred embodiment 1RE1 is mutated so as to constitutfveiy have the activity of 
phosphoryiated IRE1 . In one embodiment, serine and/or threonine residues are substituted 
with asparticacid to form to form an IRE1 variant having constitutive UPR Inducing activity*. 
Other substitutions to mimic a protein in its phosphoryiated stats are known In the art. 
Preferably, the mutations are performed on the nucleic acid encoding the protein. 

By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed 
in vitro, in general by the manipulation of nucleic acid by polymerases and endonucieases, 
In a form not normally found In nature. Generally, a nucleic acid refers to DMA, RNA or 
mRNA and includes a gene or gene fragment. Thus, an isolated nucleic acid, in a linear 
form, or an expression vector formed in vitro by iigatfng DMA molecules that are not normally 
joined, are both considered recombinant for the purposes of this invention. It is understood 
that once a recombinant nucleic acid is made and reintroduced into a host cell or organism, It 
wi replicate non-recombinantiy, i.e. using the in wVo cellular machinery of the host del! 
rather than m vitro manipulations; however, such nucleic adds, once produced 
recdmbihantly, although subsequently replicated non-recombinantly, are Still considered 
recombinant for the purposes of the Invention. 

Similarly, a "recombinant protein* is a protein made using recombinant techniques;, 
La. through the expression of a recombinant nucleic acid as depicted above, Generally, the 
term protein and peptide can be used interchangeably herein, A recombinant protein is 
distinguished from naturally occurring protein by at least one or more characteristics. For 
example, the protein may be isolated or purified away from some or ail of the proteins and 
compounds with which It is normally associated in its wild type host, and thus may be 
substantially pure. For example, an isolated protein is unaccompanied by at least some of 
the materia! with which it is normally associated in its natural state, preferably constituting at 
least about 0,5%, more preferably at least about 8% by weight of the total protein in a given 
sample. A .substantially pure protein comprises at least about 75% by weight of the total 
protein, with at least about 80% being preferred, and at least about 90% being particularly 
preferred. In one embodiment, the definition includes the production of a protein from other 
than its host cell, or produced by a recombinant' nucleic acid. Alternatively, the protein may 
be made at a significantly higher concentration than Is normally seen, through the use of an 
inducible promoter or high expression promoter, such that the protein is made at increased 
concentration levels. Alternatively, the protein may be in a form not normally found in nature, 
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as in the addition of an epitope tag or amino acid substitutions, insertions and deletions, as 
discussed below, 

A recombinant ceil generally refers to a cetwliich has been manipulated to contain a 
recombinant nucleic acid or protein therein. 

The protein of interest to be secreted can fee any protein. Wherein the protein is not 
naturally secreted, the nucleic acid encoding the protein may be modified to have a signal 
sequence in accordance with techniques known in the art. The proteins which are secreted 
may be endogenous proteins which are expressed naturally, but in a greater amount in 
accordance with the present. Invention, or the proteins may be heterologous. In a. preferred 
embodiment, the proteins are heterologous. Heterologous as used herein means the protein 
Is produced by recombinant means. Therefore, the protein may he native to the cell, but is 
produced, for example, by transformation with a self replicating vector containing the nucleic 
acid encoding the protein of Interest. Alternatively, recombinant could be wherein one or 
more extra copies of the nucleic acid are Integrated Into the genome by recombinant 
techniques, 

in another embodiment, the protein of interest is selected from the group consisting 
of lipase, celluiase, endo-glucositiase H, protease, carbohydrase, reductase, oxidase, 
isomerase, transferase, kinase, phosphatase, afpba-amyiase, glucoamyiase, IfgnoceSfulose 
hemieeiSuSass, pectinase and iigninase. In another embodiment, the protein of interest is a 
therapeutic selected from the group consisting of vaccines, cytokines, receptors, antibodies, 
hormones, and factors Including growth factors. 

The cell in which the proteins are secreted is any cell having an upregulated protein 
response. Preferably, the host to be transformed with the genes of the Invention can be any 
eukaryofie cell suitable for foreign or endogenous protein production, e.g., any 8, cemiske 
yeast strain, (e.g., DBY746, 8MA64-1A, AH22, S1S0-28, (3YPY55-1 5bA, vtUa-6301 5) any 
Ttichodertm spp. such as T. kmgfbmeftkfam.em&'fa& T. reeser strains derived from the 
natural isolate QM8a, such as RUTC-30, RL-P37, QM9416 and V1T-D-79125, any 
Kiuymromyms spp/. Sob. pomim, H, polymorpha, Pjchm, Aspergillus, Neumspora, 
Y&rmwla, Fu&arium, Penmllsum spp, or higher eukaryofie: eeiis. 

Examples of mammalian host cell lines include Chinese hamster ovary (CHO) and 
COS ceils, Sytore specific examples include monkey kidney CV1 line transformed by SV40 
(COS-7, ATCC OBL 1851); human embryonic kidney line (293 or 293 ceils subcloned for 
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growth in suspension culture, Graham at at, J. Gen ViroL, 36:59 {1977}); Chinese hamster 
ovary eellsADHFR (CHO, Uriaub and Chasm, Pma. Natl. Acad. Sol. USA, 77:4216(1980)); 
mouse Sertoli ceils (TM4, father, BietReprod., 23:243-251 (1980)): human lung cells 
(W138, ATCC CCL 75); human liver cafe (Hep Q2, HB 8065); and mouse mammary tumor 
(MMT 060562, ATCC CCL51). 

In an alternative embodiment, a plant ceil can be utilized. In another embodiment, a 
baeuiovirus infected insect ceil is utilized. The selection of the appropriate host cell is 
deemed to be within the skill In the art. 

Transfer of the genes Into these ceils can be achieved, for instance, by using the 
conventional methods of transformation described for these organisms. General aspects of 
mammalian cell host system transfecfions have been described in U.S. Patent No. 
4,399,21 8. Transformations Into yeast are typically carried out according to the method of 
Van Spiingen et at, J. 8act„ 130:946 (1977) and Hsiao et al„ Proc. Natl. Acad. Set ("USA), 
76:3829 (1 979), However, other methods for introducing DNA Into ceils, such as by nuclear 
microinjection, electroporation, etc. For various techniques for transforming mammalian 
cells, see Keown ©t at, Methods in Enzvrooloav, 18S:52?-S37 (1 990) and Mansour et at, 
Nature , 336:348-352 (1988), 

The nucleic acid (e.g., cONA. coding or genomic DNA) encoding the UPR 
modulating protein may be Inserted into a repltcahle vector. Various vectors are publicly 
available. The vector may, for example, he In the form of a plasmid, cosmic, viral particle, or 
phage. The appropriate nucleic acid sequence may be inserted into the vector by a variety 
of procedures. In general, DNA is inserted into an appropriate restriction encSonueiease 
sste(s) using techniques known in the art. Vector components generally include, but are not 
limited to, one or mora of a signal sequence, an origin of replication, one or more marker 
genes, an enhancer element, a promoter, and a transcription termination sequence. 
Construction of suitable vectors containing one or more of these components employs 
standard ligation techniques which are known to the skilled artisan. 

For yeast secretion the signa! sequence may be, e.g., the yeast invertase leader, 
alpha factor leader (including Saech&mmyms and Kfuywmmyces abactor leaders, the latter 
described in U.S. Patent No. 6,010,182), or acid phosphatase leader, the C. albicans 
glucoamyiase leader (EP 362,179 published 4 April 1090), or the signal described in WO 
90/13646 published 15 November 1990. in mammalian ceil expression, mammalian signal 
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sequences may be used to direct secretion of the protein, such as signal sequences from 
secreted polypeptides of the same or related species, as well as viral secretory leaders. 

Examples of suitable promoting sequences for use with yeast hosts include the 
promoters for 3-phssphogiycerate Kinase fHiteeman et at, J. Biol. Qhem,, 255:2073 (1880)] 
or other glycolytic enzymes [Hess et al„ J, Adv, Enzyme Reg,, 7:148 {1968); Holland, 
Biochemistry. 17:4900 {1978)3, such as enoiase, giyceraidehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofruetokinase, glucose-8- 
phosphate isonwase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase. phosphoglucoss ssomerase, and giueokinase, 

Other yeast promoters, which are inducible promoters having the additional 
advantage of transcription controlled by growth conditions, are the promoter regions for 
alcohol dehydrogenase 2, isocytoehrome C, acid phosphatase, degradative enzymes 
associated with nitrogen metabolism, matallothSonein, giyceraidehyde^S-phospbate 
dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable 
vectors and promoters for use in yeast expression are further described in EF 73,857, 

Transcription from vectors in mammalian host cells can be controlled, for example, by 
promoters obtained from the genomes of viruses such as polyoma virus, fowipox virus (UK 
2,21 1,504 published 5 July 1989), adenovirus (such as Adenovirus 2), bovine papilloma 
virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatifis-8 virus and Simian Virus 
40 {SV40)» from heterologous mammalian promoters, e.g., the actin promoter or an 
immunoglobulin promoter,, and from heat-shock promoters, provided such promoters are 
compatible with the host ceil systems. 

Transcription of a DMA encoding the protein in eufcaryotes may be increased by 
Inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of DMA, 
usually about from 10 to 300 bp, that acton a promoter to increase its transcription. Many 
enhancer sequences are now known from mammalian genes (giobln, elastase, albumin, a- 
fetoprotein, and Insulin). Typically, however, one will use an enhancer from a eukaryotic eel 
virus. Examples include the SV40 enhancer on the fate side of the replication origin (bp 100- 
270), the cytomegalovirus early promoter enhancer, the polyoma enhancer on the late side 
of the replication origin, and adenovirus enhancers. The enhancer may be spliced into the 
vector at a position 5 ( or 3' to the coding sequence, but is preferably located at a site 5' from 
the promoter. 
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Expression vectors used in eokaryotio host cells (yeast, fungi, insect, plant, animal, 
human, or nucleated cells from other multicellular organisms) will also contain sequences 
necessary for the termination of traosenptloo and for stabilizing the mRNA. Such sequences 
are commonly available from the S' and, occasionally 3', untranslated regions of eukaryotic 
or viral DMAs or cDNAs, These regions contain nucleotide segments transcribed as 
polyaclenylated fragments in the untranslated portion of the mRNA encoding the protein. Still 
other methods, vectors, and host cells are described in Gething et ai., Nature, 293:820-625 
(1981); Mantel et aL, Nature . 281:40-46 (1979); BP 117,080; and EP 117,058. 

In one embodiment, the gene is cloned into a suitable expression vector, such as 
pKKt or similar vectors comprising the appropriate regulatory regions depending on the 
selected host. For example, these regulatory regions can be obtained from yeast genes such 
as the ADH1, GAL1 - GAU0, PQK1, CUPt&AP, CYC1, PHOS, or asparagina synthetase 
gene, for instance. Alternatively, also the regulatory regions of, for example, HAC1 can be 
used to express the gene in B, cemvbim. The piasmici carrying the gene is .capable of 
replicating autonomously when transformed into the recipient yeast strain and Is maintained 
stably in a single copy due to the presence of a yeast centromeric sequence* Alternatively, a 
multicopy replicating piasrnsd could be used or integration of the plasmid into the yeast 
genomic DN A could be provided for using methods known in the art 

in one embodiment herein, to express HAOI cDNA, preferably truncated in 
Trich&derma the coding region of the inducing form of the Trhhodmma HAC1 gene is 
coupled for instance between the A. nidulms gptiA promoter and terminator and the 
expression cassette is transformed Into a Trichodemia strain producing for instance bovine 
chymosin or another foreign protein. In the truncated form, the unconventional introns are 
removed, as well as any remaining terminal end adjacent to said Intron. An uneovenfiooal 
intron is one which Is present in the mRNA In the ceil which Is not undergoing UPR, but 
which is removed from the mRNA upon induction of the UPR, UPR would be thus induced 
eonstiiutively, A higher level of expression which was Inducible according to the carbon 
source used for growth of the fungus could be achieved by fusion of the Inducing form of 
HAC1 with the promoter of the T, rms&icbhl gene. 

For filamentous fungi the HA01 gene is preferably integrated into the genome using 
methods known in the art. Suitable promoters in addition to the gpdA or cbhl promoters or 
promoter of the HAC1 gene itself are for instance the other celulase promoters, cbh£ f egit, 
eg/2, or teff, pgk f pki, the glucoamylase, aipha-amyiass or the alcohol dehydrogenase 
promoter. In filamentous fungi transformation usually results in strains with varying copies of 
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expression vector integrated into the genome (Perittiia eta£, 18S?) and from these the strain 
with optima! level of truncated expression for growth and enhanced secretion can be 
screened. 

It is understood that the methods provided herein may further include cultivating said 
recombinant host cells under conditions permitting expression of said secreted protein. The 
proteins can be collected and purified as desired. In a preferred embodiment, hydrolytic 
enzymes are secreted, in another embodiment, the secreted proteins are used in improved 
alcohol production or in processes where efficient hydrolysis of raw material is desired. 

The following preparations and examples are givers to enable those skilled in the art 
to more clearly understand and practice the present invention. They should not be 
considered as limiting the scope and/or spirit of the invention, but merely as being illustrative 
and representative thereof, 



Example 1 

Effect of expression of truncated HAC1 in yeast 

in order to cause constitutive induction of the unfolded protein response in 
SaCGharomyces cerevisias, a truncated version of the yeast HACi gene was expressed from 
a eenfroroeric plasmid. The truncated version does not include the intron of HACI that in 
norma! conditions prevents the translation of the mRNA. Thus the mRNA expressed from the 
plasmid is translated to HACI protein constitutiveiy and causes a constitutive induction of the 
unfolded protein response. The appropriate HACi gene fragment was first amplified f rom 
yeast chromosomal DNA by PGR, This fragment starts 24 bp before the translation start 
codon of the HACi gene and ends with a translation stop codon inserted after the proline 
codon at amino acid position 220 of the deduced protein. The oligonucleotide primers used 
were: 6 s ATC C3CA G<3A TTC CCA CCT ACS AC A AC A ACC GCG ACT 3 ! (forward primer) 
(SEQ ID No. 20) and 5 ! TAG AGO GGA TCC CTA TGG ATT ACG CCA ATT GTC AAG3 ! 
(reverse primer) (SEQ ID No, 21}. BamHi rest ictlon sites were included info hot? of the 
primers to facilitate cloning. The PCR reaction was carried out with the Vent DNA 
polymerase (New England Bioiabs) in conditions recommended by the manufacturer. The 
PCR program used started with heating id 94° C for three minutes followed by 30 cycles with 
denaturation at 94 e C for 46 seconds, annealing at 5B 0 C for 45 seconds and synthesis at 72° 
C for one minute. The PGR product fragment of 690 bp was run in a 0.8% agarose gel and 
purified from the gel by the Qlaqufck gei extraction kit (Qiagen) according to manufacturer's 
protocol. The fragment was digested with BamBI and cloned into the BamHi site of the 
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pZERO vector (Invifrogen) with methods known In the art. The HAC1 fragment was released 
from pZERO by BamHi digestion and cloned Into the Bglt! site of the vector pKKI between 
the promoter and terminator of the yeast PQK1 gene with methods known in the art. pKK1 
contains the LEU2 selectable marker gene and the centromere (CEN8) and ARS sequences 
for maintenance In yeast as a single-copy piasmkf. The final expression plasmid was named 
PMS108 (figure 1). 

The plasmid pMS109 and the control plasmid pKK1 were transformed Into a yeast 
strain producing Bacillus amyioliquefaemis o-arnylase. In this strain, the expression cassette 
with the a-aroyiase coding region inserted between the yeast ABH1 promoter and terminator 
had been integrated into the TRP1 locus of the yeast strain 08Y746 £cr s hisS&i, l&u2S f waS- 
62, trp1-289, Gytf ). Four pMSI 09 tmnsformants and four strains transformed with the 
vector pKKI war© selected for cultivations. The cultivation medium was synthetic complete 
yeast medium without leucine (SC-Leu, described by Sherman 1991 , Meth. Enzyrool. 194, 3- 
21). buffered to pR 8.0 with 2% succinic acid and supplemented with 2% glucose as the 
carbon source, The 50 ml yeast shake flask cultures were inoculated to the Initial GP8Q0 
(optica! density at the wavelength of 809 hrh} of 0.2 and growth was carried out for five days 
at 30° 0 and 250 RPM. Samples ware taken daily for monitoring yeast growth and ©-amylase 
production . d«aroy!ase activity was measured with the Phadebas Amylase Test f Pharmacia) 
according to the instructions of the manufacturer. Yeast ceil density was determined by 
measuring OO600 (optical density at the wavelength of 800 nm) of the culture. The o~ 
amyiase amounts produced by each of the pMS109 transformants wore higher than the 
amounts produced by any of the pKKI transformants, The average production ieyei of 
pMSlOO transformants was 70% higher in the end of the cultivation than the average of 
pKK1 clones (Figure 2). The growth of the pMS109 strains was slightly retarded when 
compared with the control. 

To analyse the effect, of the constitutive UPR induction to yeast fnvertase production, 
four clones transformed with pfvlSI 00 and four clones transformed with the pKKI vector, 
derived from the a-amylase producing strain descdbed above, were cultivated in the SC-Leu 
medium buffered to pH 6.0 with 2% succinic acid and containing 2% sucrose as the carbon 
source. The 50 mi shake flask cultures were inoculated to the Initial OD600 of 0.2 and grown 
subsequently for five days at 30 % and £80 RPM, Yeast growth was followed by measuring 
the ODS0Q and samples were taken for in vartase assays on days three, four and five. For 
each assays cells were harvested by centrifugatton from 1 ml of the culture. The ceils were 
washed with 5 ml of 10 nM NaN^and rasuspeoded In 0.2 M NaAc buffer, pH5,0 with 10 mU 
HaH s . The Inverfase activity of the cells was measured by Incubating them with 0.166 M 
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sucrose in 0.2 M NaAc buffer. pH 5,0 for 6 minutes- The reaction was stopped by adding one 
volume of 0.5 M KPO4, pH 7,0 and by separating the ceils rapidly from the reaction mixture 
by filtration. The glucose formed into the reaction mixture was measured by the GGD-Perld 
kit (Roehrlnger Mannheim) according to the manufacturer*® protocols. The invertase 
production of the pMS109 traosformante was about 2 times higher than that of the pKKl 
transformants in all the timepoints that were tested (Figure 3), 

Example 2 

Effect of disruption of HA CI in yeast 

The yeast HAC1 gene was disrupted by replacing ft in the genome with a DMA 
fragment containing the 841 8 antibiotic resistance cassette flanked by 48 bp sequences 
from the 5 ! and 3' ends of the HAC1 open reading frame. The G418 resistance cassette 
consists of the £ coti kanamyein resistance gene cloned between the promoter and 
terminator of the Ashbya gmsypH IFF gene encoding translation elongation factor 1 > The 
DMA fragment used in the disruption of the yeast HAC1 was produced by PGR from the 
kenMXa module (Wach ef al., 1994, Y§asl 10, 1793-1808} with the oligonucleotide primers 
5' CCA CCT ACQ ACA ACA ACC GCC ACT ATG GAA ATG ACT GAT TTT GAA CTA CTT 
GCC TCG TCC CCG CCG GGT CAC 3* (forward primer) (SEQ ID No. 22) and S' AAT TAT 
ACC CTC TTG CGA TTG TCT TCA TGA AGT GAT GAA GAA ATC ATT G AC ACT GGA 
TGG CGG CGT TAG TAT CGA 3 s (reverse primer) (SEQ !D No, 23), The PGR reaction was 
done With the Dynazyme DNA polymerase (Finnzymes) in conditions recommended by the 
manufacture. The PGR program started by denaturation at 94*C for 3 minutes, followed by 
30 cycles of denaturation at B4 V C for 45 seconds, annealing at S2*C for 30 seconds and 
elongation at 72X for 1 minute, A Una! elongation step of S minutes was performed at 72*C, 
The PGR product of about 1.5 kh was run in an 0.8% agarose gel and purified from the gel 
with the Qiaquick kit (Giagen). The fragment was transformed into the yeast strain BMA84- 
1 A {a, um3~1, irphA /ei/2-3, 112, hf$3*ib ade2-l s cani-IQG) with a method described 
(Gletz ef at, 1 992, Nuci. Acids Res. 20, 1425), The transformants were first grown over night 
on YPD plates (Sherman, 1 901 , Meflh 1 EnzymoJ- .194. 3-21} and then replicated onto YPD 
plates with 200 pg/mi of the antibiotic G418. The transformants resistant to G418 were 
tested on plates with yeast mineral medium (Verduyn etai, 1992, Yeast 8, 501-517) with and 
without Inositol. Chromosomal DNA. was isolated from strains that were dependent on 
inositol, and Southern hybridization with the HAC1 protein-coding region was performed with 
methods known in the art. The result of the hybridization showed that the HAOI gene had 
been disrupted in the strains dependent on inositol. 
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The effect of the HAC1 disruption on the production of two heterologous proteins, the 
Bacilius amylQliqu&fBci&ns a-amylase (Buohonen et at., 1 987, Gene 59, 1 61 -1 70) and the 
Trfohoderrm «?ese/ endogiueanase EQI (Penttili et at., 1 987, Yeast 3, 1 78-185), was tested. 
The o-amyiase was expressed from a multicopy plasmid with the LBV2 marker gene , B485 
(Ruohonen et al, 1991 , J. Bjoteohooi, n 39, 193-203, the plasmid is called YEpaa6 in this 
article), where the a-amylase gene has been cloned between the yeast ADHi promoter and 
terminator, The EG! was expressed from the plasmid pMP31 1 (Penttila et ai.» 1 987, Yeast 3, 
175-185), where the endogiucanase eDNA has been cloned between the yeast PQK1 
promote? and terminator in a multicopy vector with the LBU2 marker gem. The 8485 and 
pMP31 1 piasrnids were transformed into the HAC1 dismptant and Its parental strain with a 
described method (Qlet2 et al. s 1992, Nuci Acids Res. 20, 1425), and transformants were 
selected on SC-Leu plates (Sherman, 1991. Meth. Envmol. 194, 3-21). Three 8488 
transformer! is derived both from the HAC1 dlsruptant and Its parental strain were grown in 
50 mi shake flask cultures in SC-Leu buffered to pH 6.0 with 2% succinic acid and 
supplemented with 2% glucose. The cultures were inoculated to the initial OD600 of 0,2, and 
growth was continued for four days at 30 °C and 250 RPM, The a-amyiase activity In the 
culture eupernatants was assayed as described in Example 1 . The HAC1 disruptant strain 
produced less than 10% of the o>amyiase amount produced by the wild type control strain 
{Figure 4). To test the effect on £Gi production, three pMPSi 1 fransformants derived from 
the HAC1 disruptant and three transformanfe derived from the parental strain were grown m 
50 ml of SC-Leu (Sherman, 1991, Math, fewmol. 194, 8-21} with 2% glucose in shaker 
flasks. The cultures were inoculated to the initial ODS00 of 0.2 } and grown at 30*0 and 250 
RPM for four clays. Endogiucanase activity of the cultures was measured with the substrate 
4-methyfumbelliferyl-p-D-lactoside (Sigma). Supernatant samples were Incubated at 50°Q for 
3 hours in a reaction mixture of 0.25 mg/mi of the substrate and 0.1 U glucose in 50 mM 
NaAc, pH 5.0. The reaction was stopped by adding two volumes of 1 M Na s CO s . and the 
absorbance of the mixture was measured at the wavelength of 370 nm. The production of 
the endogiucanase EG! of the HA01 disruptam was about 50% of the level produced by the 
parental strain {figure 5). 

Example 3 

Cloning and sequence of the Asp&Fgiffm fiMuiBns ft&cA and Trich&dBrms mm&! HAC1 
genes 

A homology search was performed against a public database 
{htfp://hiQinfQ.ok et^ containing Aspergillus ntfuiam 

EST (expressed sequence tag) sequences with the yeast HAC1 protein sequence using tie 
program BLAST (Altschul et at, 1 990. J. Uol Biol. 215. 403-41 0>. The search Identified one 
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EST cDNA clone (c7a10a1 .ft) which has homology to yeast HACIp at the DNA binding 
domain. However, another region of the same cDNA clone, designated as ESTc7a10a1.f1 
in the database, had no obvious Sfmitehty with HAC1 and there was no annotation within the 
database to indicate similarity between tie ESTs and HAC1 , Therefore, it was unclear if the 
A. nidulans cDNA done encoded a functional nomolog of HAC1 or a different protein having 
a version of a DMA-binding motrf. The region corresponding to the c7a10a1 EST cDNA was 
amplified by PCR from .4, nidttiam genomic DNA isolated with methods known in the art 
The sequences of the ends of the EST cDhIA clone found from the database were used to 
design the the 5' end primer (5 s QCC ATC CTT GGT GAG TGA GCC 3'} (SEQ ID No. 24) 
and 3 1 end phmer ($< CAA TIG CTC GCT CTT AGA TTG AAT 3'} (SEQ ID Ho, 25), The 
PCR reaction was performed as described in Example 2, The PGR product of 1 ,6 kb in 
length was run In an 0.8 % agarose gel, purified from the gel with the Qiaqyick gel extraction 
kit (Giagen) and cloned Into the pGEM -AT vector (Promega) with methods known In the art 
The whole fragment was sequenced from the resulting piasmsd using internal oligonucleotide 
primers, 

To isolate the HAC1 cDNA from Tmhodmm reessi, the proper hybridisation 
temperature for cDNA library screening were determined by genomic Southern hybridization 
with the genomic hacA fragment cloned from A mduiam as a probe. The probe fragment 
was labelled with ag P-dCTP using the Random primed DNA labelling kit (Scahringer 
Mannheim) as instructed by the manufacturer. The hybridization was performed as described 
(Bambrook ©t ai„ 1989, in Molecular Cloning: a Laboratory Manual, Gold Spring Harbor 
laboratory, Gold Spring Harbor, NY) at48°C, S0°C, 55 a C and 60*C in a hybridization mixture 
containing SxSSC, SxDenbardfs, 0.5% SDS, 100 pg/ml herring sperm DNA (BSC is 0.15 M 
NaCi, 0.015 M Na-eifrate, pH 7.0, SQxDenhardfs is 1% Flcoll, 1% polyvinylpyrroitdone, 1 % 
bovine serum albumin). The filters were washed for 10 minutes at room temperature with 
2xSSC, 0.1%SDS and for 30 minutes at the hybridization temperature with the same 
solution. The T, reess'i cDNA library constructed into the vector AZAP (Stratagene, Stalbrand 
etal., 1995, Ao pi. Enviro n, Microbiot 81, 1090-1997} was plated with the appropriate £ call 
host strain, and the A-DNA was lifted onto nitrocellulose filters (Schleicher & Sehul!) as 
Instructed by the manufacturer. Hybridization of the filters was done for 18 hours at S5°C In 
the same hybridization mixture as the Southern hybridization. The filters with A-DNA were 
washed for 1 0 minutes at room temperature with 2xSSC, 0.1 %SDS and for 30 minutes at 
55*0 with the same solution. The A-elones hybridizing with the probe were excised into 
pBluescripi piasmkis containing the cDNA inserts as instructed {Stratagene). The cDNA 
clone carrying the largest Insert {in the plasmid pMSilS, Figure 6) was chosen for 
sequencing, and the whole sequence of its insert was determined with the help of Interna! 
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sequencing primers. The genomio eopy of the X mmelgem was Isolated by hybridization 
of a genomic A~iibrary in the vector AEMBL3 (Kaiser and Munray, 1 985, In PNA Cloning: a 
Practical Approach , pp. 1-47* ed> Qtoyer, !RL Press, Oxford). The library was plated with the 
appropriate E. mil host strain and A-DNA was lifted onto nstroceSlu lose filters (Schleicher & 
Schuil) as instructed by the manufacturer. The litem were hybridized at 42 <l C over night in a 
hybridization mixture containing 50% formamlde, oxDenhardt's, SxSSPE, 0,1% SDS, 100 
pg/mi herring sperm DMA and 1 pg/ml peiyA-DNA {SSPE is 0.18 M NaCI, imfvl EOTA. 10 
mM MaHaPO,;, pH 7.7). The filters were washed for 10 minutes at room temperature with 
2XSSC, 0,1% SOS and 30 minutes at 65°C in 0.1 xSSC, 0.1% SDS. A-DNA was isolated from 
clones hybridizing with the probe with a described method (Samhrook et ah, 1989, in 
Molecular Cloning: a Laboratory. Manual. Cold Spring Harbor Laboratory: Gold Spring 
Harbor, NY), and the genomic region corresponding to the HAC1 cONA was sequenced from 
this DMA with interna! sequencing primers. 

The sequences of the Trlchodemia remei HAC1 and Aspergillus nldulans hacA 
genes are shown in Figures 7 and Si respectively. Comparison of the genomic and cDNA 
sequences from both fungi (the cDNA sequence of hacA available In the EST database) 
reveals a conventional Intron with consensus border sequences at a conserved: position in 
both of the genes. A second intron of 20 bp is found In the T, mesei HAC1 gene. This intron 
does not have the consensus 5* border sequence (ST). The sequence around the 5' end of 
this intron Is. predicted to have a strong tendency to form a RNA secondary structure called 
hairpin loop. The area between the stems of the loop has a sequence very similar to the 
consensus sequence found at both of the Intron borders of the unconventional intron of 252 
bp found In yeast HAC1 (Figure &, Gonzalez et ah, 1 999, EMBQ. J. 18, 31 1 9-3132). When 
ihe yeast UPR pathway is induced, the IRE1 protein cleaves the HAC1 mRNA at these Intron 
borders, and thus initiates the splicing of the intron and formation of an active HACt protein, 
in the Aspergillus niduians hacA gene there is a sequence almost identical to the hairpin- 
unconventional intron region of T. rees&s HACt. 

It has been shown by RT -PGR studies that the 20 bp intron is removed from the T. 
rmsel HACt and A niduians frmA rnRNAs upon UPR Induction (Example 4), The 2S0 bp 
intron in yeast HACt prevents translation of the mRNA probably by forming a specific 
secondary structure (Chapman and Walther, 1998, Gum Biol 7. 8S0-8SS). The 20 bp intron 
in the RACl/fmaA genes of filamentous fungi can not form such secondary structures, and 
thus the activation mechanism of these genes Is different from yeast HACt. The T, m&s&i 
HAC1 cDNA encodes an open reading frame of 451 amino acids and the A. niduians hacA a 
protein of 350 amino acids, when the 20 bp introns have been removed from the both 
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sequences. The putative T. wesefmti A h/cfeMns BACI/A proteins have an identity of 
37,4% with each other and both have a DNA binding domairr conserved with yeast HAC1 
protein (Figure 1 0). The yeast HAG1 binding site has approximately 64% similarity and 53% 
identity to the- binding site of T. mmei and approximately 6S% similarity and 56% identity to 
the binding site of A nidutam. At other regions there is no detectable homology between 
yeast HACIp and the HACi of T< /ease/or the HAGA of A. nidulam. The HAC1 cOMA clone 
sequenced from T, reesel has a 5' flanking region of 471 bp, containing two short open 
reading frames encoding 17 and 2 amino acids. The 5' flanking region sequenced from A. 
w&utem hacA is 187 bp in length, containing, one upstream open reading frame of 7 amino 
acids. 

Example 4 

Demonstration of truncation at the S ! end and splicing of the 20 bp snfron of T, mm&i 
and A. niduSsm HACtihacA mHHA upon UPR induction 

When the UPR pathway is Induced in yeast, the unconventional introrj of the HAOf 
gene is spliced and thus the length of the HACi mRNA is reduced by 250 bp (Cox and 
Walter, 1998, Cell 87, 391-404). it was studied if UPR induction affects the length of the 
HACi/hacA mRNA in T. meseiand A nidulms. The T. reese/ strain RutC-30 {Montaneeourt 
and Evdleigh, 1079, Adv. Cham, Sat, 1 81 , 289-301 ) was grown in a shake flask in a 
TnGtsoderma minimal medium (Penttiia et aL 1987, Gene 61, 155-164} with 2% lactose as 
the carbon source. Growth was performed for 80 hours at 28*0 and 200 RPM, and the 
mycelium was diluted 1:10 into the same medium and grown for additional 21 hours. The 
culture was subsequently divided into two halves, and one half of it was treated with 10 my 
difhSothreitol (OTT) to Induce the UPR pathway (Saioheimo et ai. ( 1999, Mot. Gen. Genet 
282, 35-45). Mycelial samples were collected from the culture treated with DTI and the 
untreated control culture before DTT addition and 30, 80, 90, 120 240 and 380 minutes after 
the addition of DTT. Total RNA was isolated from the samples with the TRIzol reagent 
(Giheo-BRL) according to manufacturer's protocols. RNA samples of 5 pg were treated with 
gSyoxai and run in a 1% agarose gel in 10 mM Na-phospbate buffer, pH 7.0, Capillary blotting 
onto a Hybond-N nylon filter (Amsrsharn) was done as instructed by the manufacturer. The 
fulMength HAC1 cONA that was used as a p«>be was labelled as described in Example 3. 
Hybridization was performed for 18 hours at 42°C in 50% formamide, 10% dextran sulphate, 
1% SDS, 1 M NaCI and 125 pg/nil of herring sperm DMA. The filter was washed in SxSSPE 
for 15 minutes at 42*0; In IxSSPE, 0,1% SDS for 30 minutes at 42*C and in 0.1x8SPE s 
0,1% SDS for 30 minutes at room temperature. The results (Figure 11) show that the length 
of the HACI mRNA does not change in the control samples not treated with DTT'. In the 
samples treated with DTT a shorter mRNA of about 2,2 icb appears in addition to the 2,5 kb 



wo nmim 



- 3U» 



mRNA obssrved in the control samples. The fulMength H4G? cDNA probe was removed 
from th© Northern filter by incubating it in 0.1% 3DS at t0O°0. for 1 0 minutes. The filter was 
then hybridized with a probe containing a 160 bp sequence from the 5 s flanking region of the 
HAC1 gene. This probe was made by PGR from the piasmfd pM&t 1 9 (Figure 8) with the T3 
primer (5' AAT TAA GGG TGA CTA AAG GG 3'} (S1Q ID No. 28) binding to the pBlueseript 
vector as the forward PGR primer and the oligonucleotide 5 ! TOG TI G ATG ACS ACG 
ATGCGA ACA GTC ATG ACA GGG AAC G 3' (SEQ ID No. 2?) as the reverse primer. The 
PGR reaction was performed as described In Example 2. The probe preparation was done 
as in Example 3. The Northern hybridisation with the shod fragment was done as described 
above for the full-length HAC1 cDNA probe. The short probe fragment derived from the 5' 
flanking region of the HAC1 cDNA hybridized with the full-length HAC1 mRNA of 2.5 kb but 
not with the 2,2 kb mRNA that appears when UPR is Induced by DTT, Indicating that the S ! 
end is the segment absent in the 2.2 kb mRNA. If has previously been shown that the 7'. 
m&sefpdil gene Is controlled by the UPR (Saioheimo et ai., 1999, MoL Gen Qenet 282, 35- 
45). To show that the UPR is induced in this experiment with DTT, the filter was probed with 
the pdll and gpdl probes. The pdfl mRNA becomes more abundant In the mycelium treated 
with DTT, whereas the gptil mRNA remains at an almost constant level. 

To analyse more closely the change that occurs in the T. reesei HAC1 mRNA Upon 
UPR. pathway induction, the mRNA populations In Induced and uninduced conditions were 
studied by rapid amplification of cDNA ends by PGR (RACE-PCR). PolyA+ RNA was isolated 
from total RNA samples derived from a DTT-treated and an untreated control myceiia, using 
the OligoTex mRNA isolation kit (Qiagen) as instructed by the manufacturer. The Marathon 
cDNA amplification kit (Ciontech) was used in the RACE-PGR procedure according to 
manufacturer's protocols. The fi4C?-speei!lc oligonucleotide used In the reaction was 3' 
GGG AG A GGA CTG GTG GAA CGC GAT 3 s (SEQ 10 No, 28). it binds 500 bp downstream 
from the 5' end of the fulHength HAC1 cDNA. The isolated mRNA was used in synthesis of 
double-stranded cDNA, An oligonucleotide adapter was ligated to the ends of the cDNA, 
and the 5* ends of the HAC1 cDNAs in each sample were amplified by PGR with the HACh 
specific primer and a primer supplied In the kit that binds to the ligated adapter. The PGR 
program consisted of 5 cycles with denaturation at 94°C for 5 seconds and synthesis at 72*C 
for 3 minutes followed by 5 cycles with denaturation at 94°C for 5 seconds and synthesis at 
7G*C for 3 minutes and 25 cycles with denaturation at 94 !> C for 6 seconds and synthesis at 
68°C for 3 minutes. The PGR products were analysed in a 1% agarose gel. A fragment of 
the expected size (about 650 bp. Including the 5' flanking region of the HAC1 gene and the 
adapter ligated to the end), corresponding to the S.S Kb mRNA. was obtained from the 
control sample derived from mycelia not treated with DTT. A second fragment of about 250 
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bp, corresponding to the 2.2 kb mRNA size, was obtained in the PCR from the sample 
treated with DTT in addition to the one observed in the control sample. The 5S0 bp fragment 
of the control sample and the 250 fop fragment from the DTT-treated sample were isolated 
from the agarose gel with the Giaquiek gel extraction kit (Qiagen) as instructed by the 
manufacturer, and cloned into the pCR2.1*TGPG vector with the TGPO TA cloning kit 
(fnvitrogen) as Instructed by the manufacturer. Two Independent clones derived from the 
control RNA and carrying the 550 bp insert were sequenced. They had their 5' end 8 bp and 
16 bp downstream from the 5' end of the folHengfh ePNA (nucleotides 8 and 16 In the 
sequence in Figure 7} and the sequence continued until the end of the H4C?-speei!ie primer 
as in Figure 7. Seven independent clones derived from the DTT-treated mycelium and 
carrying 250 bp inserts were sequenced. The 5' ends of these fragments were each at 
different positions between nucleotides 254 and 338 in the sequence in Figure 7, and in 
each case the sequence continued until! the end of the H4C?-speeifte primer as m Figure 7. 
This further confirms that the S' end of the T. reesQiHACI mRNA is absent when the UPR 
pathway is Induced by DTT, The upstream open reading frame (uORF) of 17 amino acids is 
in the region that is left out from the mRNA* Thus this uORF could be involved in the 
regulation, preventing translation initiation at the correct start codon and formation of the 
HACI protein. 

The splicing of the 20 bp intron from the T, me$elHAC1 mRNA upon UPR induction 
was studied by reverse transcriptase-PCR (RT-PCR), The mRNA samples used in RACE- 
PCR (previous paragraph), one treated with 1 0 mM DTT and the other not. treated, were 
subjected to first strand cDNA synthesis with the Riboclone cDNA synthesis system 
(Promega) according to manufacturer's instructions. A fragment of about 500 bp in length, 
covering the region with the 20 bp intron id the HACI gene, was amplified by PGR from the 
synthesized cDNA using the forward primer 5' CCC GGA GCA GTG CTT GAT GQ 3' (SEQ 
ID No. 29} and the reverse primer 5 ! GTC GTT GAT GTG GAA GT 3' (SEG ID Mo, 30). The 
PGR program consisted of denaturation at 94*C for 2 minutes followed by 30 cycles with 
denaturafion at 94 *C for 45 seconds, annealing at S0°C for 30 seconds and synthesis at 
72 <: G for 1 minute. A final synthesis step of 5 minutes at72*C was performed, The DNA 
fragments obtained In the PGR were cloned Into the pGRS.I vector with the TOPO TA 
cloning kit (Invstrogen) as instructed by tlie manufacturer. Ten fragments derived from doth 
the DTT-treated sample and the nontreated control sample were sequenced. Nine out of the 
ten fragments from control sample had the Intron unspitced. Only two out of the ten 
fragments from the DTT-treated sample had the shrron unspiieed, showing that splicing of the 
intron occurs upon DPR induction by DTT. 
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To examine whether the S* flanking region and the 20 bp intron are removed from the 
Aspergillus nidulam haaA mRNA upon UPR induction simfiady to the T< meseiHACI 
mRNA, Northern hybfidteafiGn mid RT-POR ^erimente were carried out The .4. nitiulam 
strain FGSC A28 was grown for three days in shake flasks in a medium containing 3% 
glucose, 2.5% com steep liquor, IS g/i KH fi P0 4( 5 g^ {NH^gSO^ 5 mg/i FeSO, 1 .8 mg/1 
MnSQ 4 , 1 A mg/f ZnSO,?, 3.7 mg/l CoG! 2i pH 6.8, The culture was divided into two aliquois, 
and one aliquot was treated with 20 mM DTT and the other served as a control Samples 
were withdrawn from both aliquot® aiO : 30. 80. 120 and'240 minutes after the DTT addition. 
The mycelium was washed with 0.9 % NaCLarid stored frozen at ~70*C. Total RNA was 
isolated from the myoelia with the Trizof reagent (Gibeo-BRL) as instructed by the 
manufacturer. Agarose gel electrophoresis, Northern blotting and hybridization of the RNA 
samples was performed as described in the first paragraph of this example. The Northern 
was first probed with the full-length hacA genomic fragment shown in Figure 8. The probe 
hybridizes with a single 1.7 kb mRNA band In samples not treated with DTT. In the samples 
treated with DTT for 120 and 240 minutes., an additional band of about 1 ,55 kb Is detected, 
showing that the hacA mRNA is truncated upon UPR induction (Figure 12). The Northern 
was then probed with a short probe derived from the 5 ! end of the hacA gene. The probe 
fragment was made by PCR from the pGEM-AT vector carrying the hacA gene (Example 3) 
with the T7 primer (5 ! QTA ATA CQA CTC ACT ATA GGG C 3') (SEQ ID No. 31) as the 
forward primer and ftaeA-specifio oligonucleotide 5' TTA GGA CAG AGO CCA GGG TGT 3' 
{SEQ ID No. 32) as the reverse primer. The PCR reaction was performed as described In the 
previous paragraph. The 5' end probe has the first SO bp of the sequence in Figure & The 
shod B' end probe hybridizes only with the 1 ,7 kb mRNA, showing that the hacA mRNA Is 
truncated from the 5' end when the UPR pathway Is induced. 

To test if the 20 bp intron Is removed from the A. nklul&ns hacA gene when UPR is 
induced by DTT RT-PCR was performed. The total RNA samples isolated from myeelia 
treated with 20 mU DTT for 240 minutes and from control mycella were subjected to RT- 
PCR reactions with the Robust RT-PCR kit (Ftnnzymes, Finland) as instructed by the 
manufacturer, using the forward primer 5' GCC ATC CTT GQT GAG TGA GCC 3 ! (SEO ID 
No, 33) and the reverse primer S ! AAG AGT CGQ TGT CAG AGT TGG 3' (SEQ ID No. 34). 
The DNA fragment of about 400 bp obtained in the PCR was cloned Into the pCR2.1-TOPO 
vector with the TOPO TA cloning kit (invitrogen) as instructed by the manufacturer. Twelve of 
the cloned fragments derived from DTT4reaf©d and ten from control myeelia were 
sequenced. None of the fragments dedved '.from "the control myoeiia had the intron spliced. 
Three of the fragments derived from the DTT-treated myeelia had the Intron spliced, 
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Example S 

Complementation of yeast HAC1 and disruptions by different forms of the T. 
mese/ H&C? cBWA 

The Si mmvisias IRGi gene was disrupted in the same way as the HAGi gene 
(described in Example 2), A fragment where: a (3418 resistance cassette is flanked by 
sequences from the 5 1 and 3' ends of the iREI^m reading frame was mads by PGR. The 
forward primer 5' ATT AAT ATT TTA GCA CTT TGA AAA ATG CGT CTA GIT GGA AGA 
AAC ATG CTT GCG TCG TCC CCG CCG 6GT CAG 3' (SEC ID No. 35} and the reverse 
primer 5 ! AAG CAG AGG GGC ATG AAC ATG TTA TGA ATA CAA AAA TTC ACG TAA 
AAT GTC GAC ACT GGA TGG CGG CGT TAG TAT 3' (SEG ID No. 38} were used in the 
PGR reaction. The PGR reaction, yeast transformation, and selection and analysis of the 
dlsruptants were performed as described In Example 2 for HAC1 disruption. 

To express different forms of the T. re&sm HAGi gene in the yeast HAGi and ME1 
disrupt&nts, four expression constructs were made into the multicopy expression vector 
pAJ40l (Salohalmo etal., 1994. bloi. Microbiol. 13, 11-21) with the UHA3 marker gene and 
yeast PGKf promoter and terminator to drive the expression. One of them has the HAC1 
cDNA with the intact 5' flanking region and does not have the 20 bp intron. This plasmid, 
pMS1 31 (Figure 13), was made by releasing the HAGIcDUA from pMStIS, which is the 
pBiuescript vector (Stratagene) carrying the fuiMength cDNA, with EcoRI and AspTI 8 
digestion, filling in the ends of the fragment with Kienow polymerase and Sigating St to the 
EcoRi restriction site of pAJ401 with methods known in the art. The second construct has 
the Z'.mes&i HAGi cDNA truncated at the 5' end but does not have the 20 bp intron. The 
truncated HAG1 oDNA fragment was made by RGB from the plasmid pMS1 1 9 (Figure 6} with 
the forward primer 5' CCG GAA CAG GAC ACG GCA GGC AAC 3" (SEQ ID No. 37) and 
reverse primer 5' CTA GGT AGA CGT TGT ATT TTG 3' (SEQ ID No. 38). The PGR reaction 
was carried out as described in Example 2, The PCR product was run in a 0.8% agarose gel 
and purified from it with the Qlaqulek gel extraction kit (Giagen). The fragment was cloned 
Into the EcoRV restriction site of the pZERG vector using the Zero Background Cloning kit 
(Invitrogen) according to manufacturer's protocols. The fragment was released from this 
vector with BamMI digestion and ciOned between the EcoRi and Xhoi restriction sites of the 
pAJ40 1 vector with methods known in the art, The resulting plasmid was named pMSl32 
(Figure 14). The third and fourth egression piasmids have the 20 bp intron added to the 
HAGi cDNA forms either with or without the 5' flanking region, These piasmids were 
constructed .by replacing a Hpai-KspJ fragment of about 800 bp in p'MS131 and py8132 with 
a corresponding Hpal-Kspi fragment from a cDNA which has the 20 bp intron, isolated from 
the cDNA library in AZAP together with the cDNA in the plasmid pMS1 1 9 {Example 3). 
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To test for complementation., the four expression piasmids and the vector pAJ401 
alone were transformed into the yeast HACI and disruptanfs as described {Gietz el a!. ; 
1 992, Nyot Acids Res, 20, 1425), Four transformarffs f rom each of the transformations were 
streaked on SO~Ura plates (Sherman, 1991, Metis, Enzyme*. 194, 3-21} and grown at 30 C C 
for throe days< The plates were then replicated onto mineral medium plates (Verduyn e! ai., 
1992, Yeast S, 501 -517} with inositol and on plates without inositol These plates were 
incubated at 30 "C for three days and the streaks growing on them were replicated again 
onto the same plates. After growth of five days the inositol dependence of the transformants 
was evaluated (Figure 15). Both pMS131 (HAC1 cDNA with 5* flanking: region and without 
intron) and pUSi 32 (without 5' flanking region, without imron) could restore the ability of both 
the HAC1 and SRB1 disruptants to grow without inositol. Thus the T, rms&i HACI encodes 
the functional homoiog of the yeast HAC1 gene, When the 20 bp intron is added to pfv1S131 , 
no complementation is obtained. When the intron is added to pMSl32, the yeast disruptants 
grow very slowly without Inositol, Thus the 20 top Intron weakens the ability of the T, mm&i 
HACIgem to complement the yeast HAG1 and ,'REl disruptions. 

Example 8 

Binding of the T, rmsei HACI protein to UPB elements of the pa* 1 and itipi promoters 

A fragment of the T. reeae/HACI protein containing the putative DNA binding domain 
and leucine zipper region was produced in £. cofi as a fusion protein with the E, mil maltose- 
binding protein malE. A DNA fragment encoding this part of the HACI protein was prepared 
by PGR from the HACI cDNA with the oligonucleotide primers 5' TCG AAC GGA TCC GAA 
AAG AAG CCC GTC AAG AAG AGG 3' (forward primer) (SEQ ID No. 39} and 5' ATC GCA 
GGA TCC CTA GOT TIG GCC ATC CCG CGA GCC AAA 3' (reverse primer) (BEG ID No. 
40). The PGR reaction was performed as in Example 2. The PGR product of 380 bp was run 
in an 0.8% agarose gel and purified from the gei with the Qlaqulek gel extraction kit 
(Qiagen). The f ragment was digested with BamHf at the restriction sites included in the PGR 
primers and cloned into the BamHf restriction site of toe vector pMAt-p2X (New England 
Bioiabs) with methods known in toe art The HAGi-roaiE protein was produced in £ co# and 
purified by amylose affinity chromatography using the pNAL Protein Fusion and Purification 
System (New England Bioiabs) as recommended fey the manufacturer. The £ cott cells were 
grown up to OD800 0,5 at 37°C, IPTG was added to the concentration of 0,3 mh% and 
production was earned out for 8 hour at 24*C, The HAGI-maE fusion protein with the 
expected appa rent molecular weight was pUrifjed> 
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The oligonucleotides used In binding reactions were annealed in the concentration of 
100 mg/ml in 50 mM Trie, pH 8.0 S 10 mMlMgCi S( 1 mM spermidine and SmM DTT by heating 
them at BS*C for 10 minutes and Setting them cool down to room temperature during 2 hours. 
The oligonucleotides were iabelled Py Incubafeig 100 ng of the annealed oligonucleotide in 
1 0 mM Tris, pH 8.0, 5 mM MgCfe with 20 uCI of '^P-dCTP and 2.5 U Ktenow polymerase 
(Boehringer Mannheim) at 37*C for 30 minutes. The binding reactions between the 
Oligonucleotides having the putative UPB elements and the proteins were carried out with 
0.5 ■• 2 jjg of the HAChroalK fusion protein or 2 pg of the malE protein and 1 ng of the 
annealed and labelled oligonucleotide in a mixture containing 20 mM HEPES, pH 8,9, 50 
mM KCi, 10 mM MgCI 2 , 0.25 mM EDTA..0.S mM DTT, 2% FicoM, 5% glycerol and 100 pg/mi 
poly(didC} DNA. The competing oligonucleotides were used in 20-200 times excess of the 
labelled oligonucleotide. The binding section mixtures were incubated for 30 minutes at 25°C 
and run in a 5% polyacrviamide gel with 10% glycerol »n 12.5 mM Tris-bos-afa pH 8,3, 0.6 
mM EOT A for three hours. The gel was dried on a filter paper and exposed onto an X-ray 
film. 

The following oligonucleotides carrying the putative UPR elements of the pcSfi and 
bipi promoters were used In the binding reactions (only the leading strand is given, the 
UPBEs are given in bold); 

pdiUPREi+ii, containing both of the putative UPR elements of the pdil promoter (Saiehelroo 
etal, 1.999, Moi, 8em Genet 262, 35-45}* 

5' CGG CTG AAC GAG CGC GGC AGO GAG ATG TGG CCA AAG GG 3 1 (SEQ ID No. 41 } 

pdiUPREi, containing the UPREI of the pdil promoter in a random context 
5' GGT ACC TGG TAA CCA GCG CGG CAT GAT TO A AC 3' (SEQ ID No. 42) 

pdiUPREii, containing the UPREii of the pdil promoter in a random context 

5' GGA TCT TGG ATA GCC AGATGT GGG OTC GAT TQA CT 3' (SEQ ID No. 43) 

bipUPREi, containing the UPREI of the frip f promoter (unpublished results) 
5" GGA TTA GAA AAG GCC AAC GTQ TOG ATA ACG GTC 3' (SEQ ID No. 44)' 

bjpUPREfl, containing the UPREII of the bipi promoter, the element .Is in a reverse 
orientation in the promoter {unpublished results) 

5' GGG CGT GGA GAA GCG AGA AGT GGG OTC TTG TTC TOG 3' (SEQ ID No, 45) 
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The results {Figure 16) show that te HAGS-maiE fusion protein binds to the putative 
UPR element area found torn the pd/lprome-ter whereas the maE protein alone does not 
show any binding. The binding of the fu£to.0rO't^n^':^3e(^c,.$lnce it Is competed by an 
excess of unlabeled oligonucleotide. The f usion protein binds specifically also to the 
oligonucleotide pdiUPREIl and not at ail fo pdtUPRB, and this Indicates that the functional 
UPR element of the pdil promoter is UPREil. The MACI-maiE fusion protein also binds 
specifically to both of the putative UPR elements found in the Mpi promoter. Alignment of 
the three 7. rms&iVPR element shows that the consensus sequence for binding is 
G C ( C/6) A (<3/A) N t <>GTG(G/T)C (Figure 18} (SEQ ID No. 48), 

Example 7 

Expression m yeast of the Triehod&mm HAC1 eDNA without its 20bp fntron and 
truncated at the 5' end 

The 7. immi HAC-1 cDNA was expressed without its 5' flanking region and without 
fie 20 bp intron from the piasmid pM$132 (Figure 14}, This plasmiri and the control plasrnid 
pAJ40T were transformed with a described method (Gietz et al„ 1992, Nucleic Acids Res, 
20, 1 425) Into the yeast strain producing. BaciUus amyloiiquefadens ©--amylase described In 
Example 1,, Two strains carrying pWS132arid two strains with pAJ401 were grown for six 
days in shake flasks (250 PPM, 30*0) In SC-Ura medium (Sherman, 1 991 , Meth. Enzymoi 
1 94, 3-21 ) buffered to pH 8.0 with 2% succinic acid and growth and amylase production 
were assayed as described In Example 1, Cell samples were withdrawn from the culture tor 
Northern analysis. The o-amylase production of the pMSI 32 transformants calculated per 
bfomass was higher than that of the pAJ401 transformanis from day 3 until the end of the 
cultivation (Figure 1 7), Growth of the pMS1 32 strains was slower than the growth of the 
control piasmid strains. Four pMS132 fransforrnants and four pAJ401 transformanis were 
grown in shake flasks (250 RPM, 30 C C) in SG-Ura with 2% sucrose as the carbon source, 
and (nvettase activity produced by the cells was assayed as described in Example 1 . yore 
invertase was produced by the pMS132 transfonnants than by the pAJ401 transformants 
{Figure 18), 

To show that the truncated T.rem&i HACIcDHA is beneficial for a-amylase and 
invertase production by inducing the UPR pathway of yeast, Northern analysis was 
performed on the oeli samples withdrawn from the cultures of pMS132 and pAJ401 
transforn-fants. Total RNA was isolated from the ceils collected after 1 , 2 and 3 days of 
growth with the RNeasy RNA extraction kit (Qlagen) as instructed by the manufacturer, The 
yeast KAR£ gene is under the UPR pathway eohtrof (Cox and Walter, 1996, CelJ87, 391- 
404), and therefore the Northern filter was probed with a fragment derived from KAR8, This 
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fragment was produced by PCR from yeast chromosomal DMA with the oligonucleotide 
primers 5' GT6 8TA ATA TTA CCT TTA CAG 3' (SEQ ID No. 4?) (forward primer) and 5 1 
CAA TTT CAA TAG 0GG TGG AG 3* (reverse primer) {SEQ ID Mo. 48). A fragment from the 
yeast TDH1 gene encoding ^yceraidehyde phosphate dehydrogenase was used as a 
control probe, since this gene is expressed cohstifytively and is not expected to be affected 
by UPR. The TDH1 probe fragment was made from yeast chromosomal DMA by PCR with 
the oligonucleotide primers 5 ! TGT GAT CAC T<3C TCG ATC IT 3' (forward primer) (SEQ ID 
Ho. 49) and S ! TTA AGO CTT GGG AAC ATA TT 3 f (reverse primer) (SEQ IP No. 50); The 
PGR reaction was done as in Example 2 and the probes were prepared as described in 
Example 3. Northern blotting and hybridization were performed from the RNA samples as 
described in Example 4, The filter was exposed to the screen of the phosphoirnager S! 
(Molecular Dynamics}., and the signal intensities were quantified with the phosphoimager. 
The KAR2 signal intensities were normalized with reference to the TDH1 signal intensities. 
The results (Figure 19) show that the KAR2 mHN A abundance is 2-4-foid higher, in the 
pMS1.32 trahsforhlants than In the pAJ401 transformants In all the tlmepoints. 

Exampte 8 

Expression in Trtehotf&rma mmeioi the HAC1 gene without its 20 bp inimn and 
truncated at the S* end 

To induce the UPR pathway constiiutsvely. a form of the T, fmssi HAG1 cDNA that is 
truncated at its 5' flanking region and does not have the 20 bp intron was expressed in f, 
m&sssh The form of the HAC1 cDNA that was present in pK4S132 was expressed In yeast as 
described in Example 5 was cloned with methods known in the ah info the Ncoi restriction 
site of the vector pAN52-Not!, between the gpdA promoter and trpC terminator of Aspergillus 
nlduiam. The hygromycin resistance cassette consisting of the A, nfduians gpdA promoter 
and trpC terminator and the £ call hygromyein resistance gene was subsequently cloned 
Into the Not? restriction site of the pAN52-Noti carrying the HAC1 cDNA fragment. The 
resulting piasmid s named pMS136 (Figure 20). was transformed into 71 mmef strain 
P37PACBHlpTEX-CHY22 as described (PentfM et a!. f 1987, Gene 81, 165-184). Strain 
P37PAC8HipTEX*CHY22 was constructed by transformation of strain P37PACBHiPyr-26 
{U.S. Patent No. 5,874,278) with a version of the expression vector of pTEX-GHY. Vector 
pTEX-CHY is a derivative of pTEXtn which the coding region for the 71 rmsei 
ceiiobiohydroiase I (CBHi) signal sequence, catalytic core and linker region (amino acids 1 - 
478 of CBHi, Shoemaker, et ai. ; 1983, Bid/Technology:, 1:891-898) fused to the coding 
region of bovine procbymosin 8 {Harris etal, Nucleic Acids Research, 10:2177-2187, was 
inserted between the cbhl promoter and terminator region by methods known in the ah. 
Selection of the P37PACBH!pTEX-GHY22 transformants with pfvfSI 38 was performed on 
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media with 1 00 pg/rol hyo/omyclh. To obtain uninuclear transformer* clones, the 
transformants were sporuiated and single spores were plated on the selective medium with 
hygroroycfn. Purified transformants and the parental strain used in the transformation were 
grown in shake flasks (28*G, 200 RPM) in Tnchodenm minimal medium (PenttM at aL. 
1987, Gene 61 , 1 55-164) supplemented with 3% whey and 0.2% peptone. Mycelial samples 
were collected from the cultures on the third, f iff vend sixth cultivation days. Total RNA was 
Isolated from the myeeiia with the TRi2:o! reagent (Qibco-BRL) as instructed by the 
manufacturer. Northern blotting and hybridization were performed to the RNA samples as 
described in Example 4. The Northern filter was first probed with the full-length HACI cDNA, 
and an mRNA derived from the expression construct which is about 2.0 kb In length can be 
observed in two of the transformants in addition to the 2,5 kb band that Is derived from the 
native HACI gene (Figure 21 ). The HAC1 probe was removed from the Northern filter by 
incubating it in 0,1% SDS at 10G°G for 10 minutes. The filter was subsequently probed with 
the f. rmsmpdil, bipl and gpdl probes. Pdrt encodes the protein disuiphido isomerase 
and has been shown to be regulated by the UPR pathway {Saloheimo eta}., 1999, Moi, Gen. 
Genet. 262, 35*45), Bip l (unpublished) encodes the T. reesei homoiogue of the ER-specific 
chaperone protein Blp. The gpdl gene encodes gives raldehyde phosphate dehydrogenase 
and was used as the constitutive control probe. After hybridization the filter was exposed to 
the screen of the Phospboimager SI {Molecular Dynamics) and the signals were quanified 
with the phospholmager. The pdH and bipl signals were normalized with, respect to the gpdl 
signals. The results show that in the two transformants which express the truncated HACI 
mRNA the pdil mRNA level is 4- and 7-foid higher than in the parental strain on the third 
culture day (Figure 21). This Indicates that the UPR pathway can be induced eonstitufiveiy in 
Trichoderma reesei by the expression of HACi gene without Its 20 bp sntron and 5 ! flanking 
region 

Example § 

The effect of a T. reesei HAC1 mutation on heterologous protein production 

A Trichoderma reesei strain where the HAC1 gene is mutated was unexpectedly 
generated during the transformation of the piasmid pMS136 into the strain producing C8BI- 
chymosin fusion protein {Example 7). When analysing the transformants by Northern 
hybridization ft was noticed that one OHhe transformants (number 31} produced several 
forms of the HACI mRNA that are considerably shorter than 2 kb (Figure 21 ianes 4, 8 and 
12). On the fifth and sixth day ol the culture as described in Example 7 the unfolded protein 
response Is Induced in the parental strain of the transformation, presumably by the 
production of the heterologous protein dhymosin. This is seen in the Northern analysis as 
appearance of a HACI mRNA of about 2,2 kb (truncated at the 5' flanking region) and as the 



wo nmim 



- 39 - 



Induction of the ptfil mRNA on days S and 8 (Figure 21 }. It has previously been shown that 
the production of antibody Fab fragments' induces the pdif gone (Safohetmo ef al. s 1 999, 
fylol. (3en, Genet. 282, 36-45), in the tmoslofmant nitmber 31 the 2.2 kb HAC1 mRNA and 
the Induction of the pd/iand btplmWNM are not detested, suggesting that the HACt gen© 
of this stain is functionally impaired. To further verify this, a DTT treatment experiment of the 
transformant number 31 v/as carried out It was grown in shake flasks (28°C« 200 RPM) in 
the Trichocienrm minima! medium (PenttM at al., 1987, Gene 81 , 155-164) with 3% whey 
and 0,2% peptone for three days. The culture was divided into two aliquots and one- ot them 
was treated with 1 0 mM dithiothreitol {DTT) and the other sewed as the control. Samples 
were taken from both aliquots at 0, 80, SO, 120 and 240 minutes after DTT addition. Total 
RNA was isolated from the myeslia and Northern hybridization was performed as described 
in Example 7. Hybridization of the Northern with the HAC1 probe reveals that the UPR 
induction by DTT is severely delayed in the transf ormant number 31 . The HAG1 mRNA of 
2.2 kb is detected only 4 hours after DTT addition (Figure 22} and a 2-fold induction of the 
psSt gene Is also apparent in this tirnepoint in a wild type strain the 2.2 kb HAC1 mRNA 
appears and the ptiil induction fakes place after 30 minutes of DTT treatment (Example 4, 
Figure 11). 

The chymosin levels produced by the control strain and the transformant number 31 
were measured daily from the media of the whey-peptone cultures described in example 7. 
The measurements were done from two parallel cultures with a milk clotting assay (Cuhn- 
Coleman, etal, 1 991 , Sio/Technology, 9:076-981. Transformanf number 31 produced 
roughly the same amount of chymosin as the parental strain on days 2 and 3 of the culture, 
On the later days the chymosin levels in the culture of the mutant strain started declining, 
whereas the control strain couid still increase significantly the chymosin amount in its culture 
medium (Figure 23). The difference between the two strains is evident in the late stages of 
the culture, where the UPR pathway is induced in the parental strain but not in the strain 
number 31 . This suggests that a imo^m&lHAC'l gene and Induction of the UPR pathway in 
the late culture stages is needed for efficient production of CBHI-chymoein fusion protein in 
7, reeseL 

Example 10 

Cloning and sequences of the Aspergitim nldislam ptcB and Trichod&rma reeeei pi$2 
genes 

The yeast protein phosphatase encoded by the P7G2gene has been shown to be 
involved in the regulation of the UPR pathway (Welifanda at ai., 1998, Moi, OelLRiol, 18, 
1 §07-1977). The IRF1 protein Is phosphoryiaiad when the UPR pathway Is turned on 



wo nmim 



» 40 - 



(Shamu and Walter, 1998, EMBO J. 15:3828-3039}, and Pte2 dephospherviafes IRElp and 
regulates the UPR negatively. A BLAST search (Alisehul et at, 1990, J. MoL Blot. 215. 403- 
410) was made with the yeast Pte2 sequence against "the public database containing 
Aspergillus nkiulans EST cDNA sequences, and the cDNA clone S2e04ai was found to fee 
homologous to it within the database. The region corresponding to this cDNA was amplified 
by PCR from Aspergillus nldulans genomic DM with the oflgonucleofides $' TTQ AAC AGC 
AQA TCG TTA CTQ 3' (forward primer) (SEQ ID No. 51) and 5' TAT AAA GTT COT GAA 
TAG TGG 8' (reverse primer) (SEQ 3D No, 52). The PCR reaction was carried out as 
described in Example 2. The resulting PGR fragment was cloned Into the pCR2.1 vector 
with the TOPO TA cloning kit (Invitrogen) as instructed by the manufacturer, it was 
sequenced with Interna! oligonucleotide primers (Figure 24). The optimal hybridization 
conditions for isolation of the T, reesei picS cDH A were determined by Southern 
hybridization of 7. mesm genomic ONA with the A, nldulans ptcB fragment as described in 
Example 3, A T. r&esei cDUA library constructed in A2AP (Stratagene, StaSbrandt et al., 
1 995. APfil EnvirQ^ , ^cm^ p( f 81,1 090- 1 097} was screened by hybridization with the A, 
nldulans ptcB fragment as descr ibed in Example 3. The A-clones hybridizing with the probe 
were excised into pBlusscript piasmitte with the cDNA inserts as instructed (Str&tagene), and 
the clone having the longest insert based on restriction enzyme digestion was chosen for 
sequencing. The Insert of this cDNA .clone is 1 830 bp in length, encoding an open reading 
frame of 438 amino acids (Figure 25). The putative Trichoderrm PTCIi protein (used 
interchangeably with PTC2) shows the highest Identity among yeast proteins to Pte2, 48%. it 
also shares 80% identity with the putative PTC2 protein from Schlzosaccharomyces pombe. 
The ptcB fragment cloned from Aspergillus nldulans is 1 284 in length (Figure 24). Based on 
homology with other Ptc2 sequences, an intron has been identified in the fragment The 
deduced amino acid sequence is 89% identical to T. re&s&i PTCII protein over an area of 
1 1 7 amino acids. 

Example 11 

Cloning and sequences of the A&pergtilm nMulans IreA and Trmh&dmma rms&l IRE1 
genes 

A search with the program BLAST {Aifsbhul et al, 1990, J, tool Biol 2 1 S. 408-410) 
was made with the yeast IRE1 protein sequences against me public database containing 
Aspergillus nldulans EST cDNA sequences. The EST clone v1 hOlal was homologous to 
yeast IRE1 protein and include such annotation. The region corresponding to this EST cDNA 
was amplified by PCR from Aspergillus nldulamgmwtmc ONA with the oligonucleotides 5* 
CGG AGG CAA GAG TCA TAG ACS 3' (forward primer) {SEQ ID No. 53) and 5' CAA TAT 
ATT TCT GAA CCA GTA CG 3" {reverse primer) (SEQ ID No. 54). The PCR reaction was 
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carried out as described In Example 2, The resulting PGR fragment was cloned into the 
pGRS.I vector with the TOPG TA cloning kit (fnvltrogen)as instructed by the manufacturer, 
31 was sequenced with Interna! oligomicieotide pnmers. The fragment was used as a probe in 
isolation of the 7. reeseiLREI gene. Optimal hybridization conditions were first determined 
with Southern hybridization of genomic- 7. memf&HA as described In Example 3. A ?* r&esei 
genomic library constructed in AEM8L3 (Kaiser and Murray, 1985 : in DNA Cloning; a 
Practical Approach . pp. 1-47, ed. <3lover» iRL Press, Oxford) was then plated with the 
appropriate E cotihast strain and A-DNA was lifted onto nitrocellulose filters (Schleicher & 
Sehirii) as instructed by the manufacturer. The filters ware hybridized over night at 50°C in a 
mix containing SxSSC, SxDenbardfs, 0.5% SDS, 100 pg/mi herring sperm DNA (SSG Is 0.15 
M NaCI, 0.015 M Na. citrate, pH 7,0, SOxDenhardt's is 1% Rcoli ; 1% polyvinylpyrrolidone, 1% 
bovine serum albumin). The fitters' were washed for 10 minutes at room temperature with 
2xSSC, 0,1%SDS and for 30 minutes at 50*C with the same solution. A-DNA was isolated 
from clones hybridizing with the probe with a described method (Sambrook eta!,, 1980), 

Most of the protein-coding region of the genomic ff?El gene was subcloned Into 
pBiuesoript SIC as 2,1 kh and 2.4 kh BamHI fragments with methods known In the art. These 
fragments were sequenced with synthetic oligonucleotide primers, The two subclone 
fragments do not cover the whole open reading frame, and thus the 5' end of the 
chromosomal gene was sequenced from DNA isolated from the A-clone isolated from the 
genomic library. An IRE1 cDNA was isolated from a 7, reemi library constructed in AZAP 
(Stratagene), The cDNA library was plated with the appropriate £ coil host and lifted onto 
■nitrocellulose filters (Schleicher & Sehuil) as instructed by the manufacturer. The probe 
fragment used in the screening was obtained by digesting the 2,4 kb genomic subclone 
pteemid with BamHI and Smal. The fragment of about 800 bp was run in a 0,8% agarose gel 
and isolated from the gel with the Qiaguiek gel extraction kit (Qiagen) with manufacturer's 
Instructions. The probe was labeiied with ^P-dGTP with the Random Primed DNA labelling 
kit (Boehringer Mannheim). The filters were hybridized at 42*0 over night In a hybridization 
mixture containing 50% formamid ; 5x Denhardfs, 5x SSPE, 0,1% SDS, 100 ug/ml herring 
sperm DNA and 1 pg/mi poiyA-DNA (SSPE is 0,18 U NaCI, 1mM EDTA, 10 mM NaH s P0 4! 
pH ?,?), The filters were washed for 10 minutes at room temperature with 2x380, 0.1 % SDS 
and for 30 minutes at 85*0 in Q.txSSC. 0,1% SDS, A-clones giving a hybridization signal 
were converted into pBlueseript piasmids by fn two-excision as instructed (Stratagene), The 
7. meseJ IRE 1 cDNA was sequenced from one of the piasmids with internal oligonucleotide 
primers, 
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The area sequenced from the T, remer iiWIgem is about 4.5 kb, and the open 
reading frame encodes a protein of 1233 amino acids (Figure 27), Comparison of the 
genomic and cDNA sequences repeated one iniron. The Z rms&i SRE1 protein starts with a 
predicted signal sequence of 25 amino acids* There is & putative transmembrane segment at 
positions 574-598 of the open reading frame. The N-tenTiina! domain (before the 
transmembrane segment) presumably facing the lumen of the endoplasmic reticulum has 
24% identify and 89% similarity over an area of 377 amino acids with yeast IRE1 p. The C- 
terminal part with the kinase and RNAse -domains is 42% identical and 59% similar over an 
area of 490 amino acids to yeast iRElp, The cloned A nlduiam ireA f ragment is 1 570 hp in 
length (Figure 28). if encodes the kinase and RNAse domains of the IREA protein. Based in 
comparison with the yeast and 7*, rease/!RE1 sequences, an intron is identified in the 
sequence of the ireA fragment The deduced. A nkiuians IREA amino acid sequence lids 
52% identity over an area of 507 amino acids to the 7. rees&i iREl protein. 

Example 12 

Cloning and constitutive expression of the Aspergillus niger v&r. 0W8m&riMcA ©DMA 

The A nigervar, awamori hacA cDNA was Isolated by heterologous hybridisation 
with the cloned Aspergillus nktutms hacA fragment described In Example 3, A cDNA library 
constructed from A, niger var. awamori RNA in the plasmsd pYE82 (Invltrogen) was plated as 
£ coil colonies, lifted onto nitrocellulose filters and screened by colony hybridisation as 
described for the isolation of the 7* maseihacl cDNA in Example 3. The hybridisation and 
the final washes were performed at 57° C. Positive colonies were found and examined by 
restriction analysis and sequencing of the cDNA ends. The longest cDNA was sequenced 
throughout its length from both strands, it Is 1.68 kb long and encodes a protein of 342 
amino acids (Figure 28), The encoded protein has 78% Identity with A nitiuians HACA 
protein and 38% identify with T. te&s&i HACI protein. The A, niger v&r, awamori hacA cDNA 
has an upstream open reading frame encoding 44 amino acids. The region of the cDNA thai, 
according to homology with the 7. reese/and A niduiam had/A genes, had a 20 bp intron 
was sequenced from live of the A nigervat awamori hacA cDNA clones isolated. One of 
these clones did not have the 20 bp in&ron presents showing that the intron can be spliced 
out as is shewn in Example 4 tor the 20 bp introns of T. 'imsei had and A. nkiuians hacA 
genes. 

The UPR-lnduced form of the A org er van awamori hacA cDNA was expressed in A. 
nfgerv&r. awamori strains producing 'Tmm&tes mrsic0iorhcca.se or bovine preprochymosin 
which were constructed in the following 'manner.- Strains MPS and &AP4 {described In 
Berka, RM et ai„ 1990, Gene 88:183-182) are equivalent strains which are deleted for the 
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pep/l gene (encoding the major -®(*^ceiKilaff^rfic.p^^n^) and which have a pyrGmi! 
mutation. 

Strain &AP3 was transformed with pUCpj*QBS3 fc Create strain 
^AP3pUCpy»<3RQ3#1 1 whfeh produces bovine preprochyrnosin, This strain secretes and 
accumulates active chympsin (an aspartie proteinase) in the culture medium. The piasmid, 
pUCpyrGRGS, consists of the GR63 expression cassette (encoding the Aspergillus nlger 
gs'aA promoter, preprochymoslri open reading frame and gtaA terminator) obtained from 
pGRG3 (Ceilan, D. et at, 1987, Bio/Technology 5:389-376) and the Neurospara cmssa pyr4 
gene inserted into pUC19. Transformants of strain AAP3 with this piasmid ware selected cn 
the basis of uridine auxotrophy. Transformants were screened in liquid culture for chymosin 
production and strain AAP3pUCpy/GR(s3#1 1 was chosen as the best producer. 

Strain &AP4 was transformed with pGFT-iXCI to create strain MP4:pGPTIacease 
which secretes Tmm&t&s versicolor iaccase 1. The piasmid, pGPT-LCCI, is a derivative of 
piasmid pGPTpyrG 7 (described in Berks, BM. and Sameti C.C., 1989, Bioteehnoi, Adv. 
7:127-154) which contains the N. cmssa pyr4 gene as fungal selectable marker and the A. 
ntgwgteA prompter and A ntgerv&t. awamorigiaA terminator region separated by cloning 
sites. To Create pGPT-tOCl the open reading frame for the Tmm&tes mr&icoior hc1 cDNA 
{Qng, E, et ai, 1 997, Gene 1 96:1 13-1 19) was inserted between the giaA promoter and 
terminator regions in pGPTpyrGt. Transformants of strain &AP4 with this piasmid were 
selected on the basis of uridine auxotrophy. Transform-ants were screened in liquid culture 
for iaccase production and strain MP4:pGPTiaecase was chosen as the best producer. 

For the over expression of bacA, the induced form of the ,4, nigervm. awamori hacA 
cDNA was first created by deleting the 20 bp intron and truncating the 5' flanking region by 
about 1 50 bp, which omitted the upstream open reading frame. This was done by methods 
known In the art The resulting hacA gene fragment was then cloned into an A, nigermt, 
a wamori expression vector with methods known in the art. in the final expression construct, 
pMS1S2 (Figure 29), the hacA gene fragment is between the A. roper var. awamorigiaA 
(giucoamylase gene) promoter and terminator. The A niduiam a/ndSgene endodlng 
acetamldase was in the piasmid as a selection marker for fungal transformation. 

The haeA overexpression censtruct {pMSl52) was transformed into either A, ntgw 
var, awamori strain MP3pUGpyK3RG3#11 or strain MP4;pGPTiacease, The 
transforations were perfamted- aades&lbed in ;p«n88a at ah, 1987, Gene 61, 15S-1S4. 
The transformants were selected for the ability to grow on aeefarnide as the sole nitrogen 
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source. Transformants were passaged fires times on selective medium before they were 
sporuiated and single spores were plated on the selective medium. 

For Southern analysis the purified transtormants and the parental strains were grown 
in shake flasks (28°C, 200 rpm) in Clofltie specia! medium (described in WO 98/31 821 }. 
Mycelial samples fortotal-DNA isolations were collected on the third cultivation day. The 
isolations were done with the DMA EASY kit (fnvftrogen) according to the manufacturer's 
instructions, 5ug of the total DMA was cut with restriction enzyme HMW and Xhoi to obtain a 
5,2 kb-fragment from the integrated ptv1S152 to indicate which transformants have the haeA 
overexpressfon cassette and samples were run in i% agarose gel in IxTBE-buffer, The 
treatment of the gels and capillary blotting onto a Hybond-N nylon filter (Ameraham) were 
done as Instructed by the manufacturer* A fragment of the A. nigervzr. mvamonhmA cDNA 
labeled as described In Example 3 was used as a probe in the Southern hybridisation. The 
filters were hybridised at 42*C over night in a hybridisation mixture containing 50% 
formamide, SxDenhart's, SxSSPE, 0,1% SOS, 100 pg/mi herring sperm DNA and Ipg/rnl 
poly (A)-DNA, Filters were washed as described in Example 4, A band of the expected size 
was obtained from all the transformants thai were analysed, but not from the parental strains. 
This indicated that the obtained transformants were stable and that they contained intact 
«ao4 ovarexprassion cassette, 

Eight transformants from the Saecase-producing strain and four transformants f rpm 
the chymosin-producing strain shown to contain the fmcA overexpression cassette ware 
cultivated again for Northern analysis and measurement of the enzymatic activities. The 
pfv'ISi 52 transformants of the strain producing preprochymosin and the uofransformad 
parental strain (AAP3puepyrt3RG3#11) were cultivated in Clofins special medium in shake 
flasks (28*0, 200 rpm) in two parallel cultures for six days. Mycelial samples for RNA 
isolations were taken on the third day of the cultivation. The pMS152 transformants of the 
strain producing Trametes lacease and the untransformed parental strain 
(&AP4.pGPT!accase} wars cultivated in 8 g/lifre Bacto Soytone (Dlfco), 12 g/iitre Tryptone 
peptone (Dlfco), 15 goitre (NH^SO^ 12.1 g/lftre NaH 2 PO,.H 2 O and 3,3 g/iitre 
Na?MP0 4 JHiO, After auioclaving 5 ml/iitfe of 20% MgSO« solution, 2 ml/litre of Gu/cltrafe 
solution (1 10 g/iitre citrate*H;A i$S : g/m& CuSO^SHaO), 1 mi/litre Twaen 80, 300 mi/litre 
50% maltose solution and 200 mlrlstre of 100 mg/lltre arginlne was added to the medium. The 
cultivations were done in shake flasks (28*0, 200 rpm) in two parallel cultures for ten days. 
The mycelial samples for RNA Isolations were taken on the second day of the cultivation. 
Total RNA's were isolated from all the mycelial samples using the TRIZOL reagent {Oibco- 
BRL) as Instructed by the manufacturer. RNA samples of 5 pg were treated with glyoxai and 
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.run in 1% agarose gel in 10 mM Na-phosphate buffer, pH 7.0, Northern hiof&igs and 
hybridizations were don© as described in Example 4. A fragment of the A nig&rv&t. awamori 
fracA cDNA labeled as described in Example 3 was used as a probe. An mRNA of the 
expected size from the hacA overaxpression cassette of about 1.6 kb was observed in ail the 
transformants studied in addition to tie band of about 1,7 kb that Is derived from the native 
hacA gene and that is also seen In the controls. This Indicates that the S -truncated and 
infroniess hacA coming from the overexpression cassette is expressed In the transformants. 

Example 13 

The effect of A, mger vm\mmmor{ hacA memxpm$$ion on heterologous protein 
production 

Samples from the culture supernatapis of the pMS1 52 transformants of the strain 
producing preprocnymosin and the untransformed parental strain (AAP3pUCpyrGR63#1 1} 
were taken on the fifth day of cultivation. The cbymosin production levels were measured 
with a miifc-ciottlng assay. The samples were diluted into buffer containing 10 g/!itre sodium 
acetate and 5 ml/litre 1M acetic add. 200 pi of the diluted sample was added to 5ml of buffer 
containing 55 g/BQO ml skim milk (Difco) at 30°C, The clotting of the milk was observed 
visually and the time that the clotting of the milk took was recorded and correlated to a 
known standard. All the four transformants produced 1.3 - 2.8 fold more chymosln than the 
parental strain (Figure 30). 

Samples from the culture supsrnatanfs of the pfvSS152 transformants of the strain 
producing Tmmetm iaccase and the untransformed parental strain (&AP4'.pGPTIacease) 
were taken on the fifth and seventh day of the cultivation. The iaccase activity measurements 
were made from the supernatants and the results showed that all the transformants produce 
more iaccase than the parental strain. Iaccase activity was measured according to Niky- 
Paavola at al. {Niku-Paavoia M-L, Karhunen E s Saloia P, Raunio V (1983) LignlnoSytie 
enzymes of the white-rot fungus Phiebia radiate. Bioohem, J. 254: 877-834} using A8TS 
(Boehringer Mannheim; Mannheim, Germany) as a substrate. The production levels of the 
transformants In the fifth day samples were 3 to 7,6 fold higher than in the parental strain, On 
the seventh day of cultivation the transformants produced 2 to 5.4 fold more iaccase than the 
parental strain (Figure 31), 

These results demonstrate that overeKpression of an inducing form of hacA enables 
production of higher levels of secreted heterologous proteins in A. niger. 

Although the foregoing invention has been described In some detail by way of 
illustration and example for purposes of clarify and understanding, it will be obvious that 
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1 . A method of increasing the secretion of a heterologous protein in a eukaryotlc 
coil comprising inducing an elevated unfolded protein response (UPR). 

2. The method of Claim 1 wherein Inducing is by increasing the presence of 
HAC1 protein in said cell, 

3. The method of Claim 2 wherein said HAC1 protein is consiJUifively produced. 

4. The method of Claim 2 wherein said increase of HAG1 protein is by a UPR 
inducing form of a HAC1 recombinant nucleic acid. 

5. The method of Claim 2 wherein said HAC1 protein Is encoded by a nucleic 
acid isolated from a eel! selected from the group consisting of Aspergillus, Trichoderma, 
Sacoharomyces, 3chizo$acehammyce$ f Ktuyvefomyces, Pichia, Hansenula, Fusaritm, 
Neurospora, and Penitiiiium. 

8. The method of Claim 2 wherein said HAG1 protein is encoded by a nucleic 
acid isolated from yeast, 

7. The method of Claim 8 wherein said yeast is Saccharomyoes cemvlsia®. 

8. The method of Claim 2 wherein said HAC1 protein is encoded by a nucleic 
acid isolated from filamentous fungi, 

9. The method of Claim 8 wherein said fungi Is from Ttfchoderma, 

1 0. The method of Claim 8 wherein said fungi is Trichod&rma rease/, 

1 1 . The method of Claim 8 wherein said fungi is from Aspergillus* 

1 2. The method of Claim 8 wherein said fungi is Aspergillus nkluims. 
i 3, The method of Claim 8 wherein said fungi is. Aspergillus nlgm 
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14, The method of Claim 1 wherein said Inducing is by modulating the level of 
IRE1 protein or PTC2 protein in said celi. 

1 5, The method of Claim 1 wherein said Inducing is by increasing the level of 
IRE1 protein, 

1 8. The method of Claim 1 5 wherein said iREt is an IRE! variant which has the 
activity of a constitutfveiy phosphofylated IRE1. 

17. The method of Claim IS wherein said SRE1 protein is encoded by a nucleic 
acid isolated from a celi selected from Jfte group consisting of Aspergillus, Trthaderim, 
SaacMromyces, $ehizosaccharomyce$> Kiuyveromyces, Pichia, Hans&nuia, Fusamm, 
Neuro&pma.. and p&mcilliam. 

1 8. The method of Claim 15 wherein said (RE1 protein is encoded by a nucleic 
acid isolated from yeast. 

1 9 . The method of Claim 1 8 wherein said yeast is Saccharomyces cerevmaa, 

20. The method of Ciaim 1 5 wherein said IRE1 is isolated from filamentous fungi. 

21 . The method of Claim 20 wherein said fungi is from Tnohod&nm. 

22. The method of Ciaim 20 wherein said fungi is Tilchodmma mes&i. 

23. The method of Claim 20 wherein said fungi is from Aspergillus. 

24. The method of Ciaim 20. wherein said fungi is Aspergillus ntdufam. 

25. The method of Ciaim 20 wherein said fungi is Aspergillus niger. 

28. The method of Claim 1 wherein said ceil is selected from the group consisting 
of Aspergillus, Tnchoderma : Saccharomyces, Schimsacchammyces, Kkiywrwnyces, 
Pichia, Haneenula, Fusanum, Neumspom, and Penmillium> 



27, The method of Ciaim 1 wherein said ceil is a yeast celi. 
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28. The method of Claim 27 wherein said yeast Is Sacchammyoes cemvlslae. 

29. The method of Claim 1 wherein said cell is from filamentous fungi, 

30. The method of Claim 2§ wherein said fungi is from Triehodsmm. 

31 . The method of Claim £9 wherein said fungi is Tmhoderrm reeseL 

32. The method of Claim 20 wherein said fungi is from Aspergillus. 

33. The method of Claim 29 wherein said fungi Is Aspergillus ntduiam, 

34. The method of Claim 29 wherein said fungi is Aspergillus nlg&r, 

35. The method of Claim 1 Wherein said cell is an insect cell. 
38, The method of Claim 1 wherein said ceil is a mammalian cell. 

37. An Isolated nuoieic acid encoding a HAC1 protein, wherein said HAC1 protein 
induces unfolded protein response and has less than 50% similarity to yeast HACT protein, 

38. An isolated nueieie acid encoding a HAC1 protein, wherein said HAC1 protein 
induces unfolded protein response and wherein said HAC1 protein comprises a DNA binding 
region that has greater man 70% similarity to the DMA binding region of filamentous fungi 
HAC1 protein, 

39. The nucleic acid of Claim 38 wherein said filamentous fungi HAC1 protein has 
an amino acid sequence as shown in Figure 7 or Figure 8 or Figure 28. 

40. The nucleic acid of Claim 38 wherein said HAC ! protein has an amino acid 
sequence having greater than 70% similarity to the sequence of Figure 7 or Figure 8 or 
Figure 28. 

4 1 . The nucleic acid of Claim 38 isolated from Tmfiati&mia reeseL 

42. The nucleic acid of Claim 38 isolated from Aspergillus nidulam. 
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The nucleic acid of Claim 38 isolated from Aspergillus nig&r> 



44, The nucleic acid of Claim 38 wherein said HACl protein has an amino acid 
sequence as set forth in Figure 7. 

45, The nucleic acid of Claim 38 wherein said HACl protein has an amino acid 
sequence as sal forth in Figure 8. 

48. The nucleic acid of Claim 38 wherein said HACl protein has an amino acid 
sequence as set forth in Figure 28. 

47, The nucleic acid of Claim 38 wherein said nucleic acid comprises a coding 
nucleic acid sequence as set forth in Figure ?. 

48, The nucleic acid of Claim 38 wherein said nucleic acid consists essentially of 
a coding nucleic acid sequence as set forth Jr* Figure 7, 

49, The nucleic acid of Claim 38 wherein said nucleic acid comprises a coding 
nucleic acid sequence as set forth in Figure 8. 

50, The nucleic acid of Claim 38 wherein said nucleic acid consists essentially of 
a coding nucleic acid sequence as set forth in Figu re 8. 

51 , The nucleic acid of Claim 38 wherein said nucleic acid comprises a coding 
nucleic acid sequence as set forth in Figure 28. 

52, The nucleic acid of Claim 38 wherein said nucleic acid consists essentialiy of 
a coding nucleic acid sequence as setforth in Figure 28. 

53, A protein encoded hy the nueiefc acid of Claim 37 or Claim 38. 

54, A protein having unfolded protein response Inducing activity and having 
greater than 70% similarity to an amino acid sequence of Figure 7 or Figure 3 or Figure 28. 

55, A protein having an amino acid sequence as set forth in Figure 7 or Figure 8 
or Figure 28. 
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58. An isolated nucleic acid encoding a PTC2 protein wherein said PTC2 protein 
modulates unfolded protein response and whe rei n said PTC2 has at least 70% similarity to 
an amino acid sequence of Figure 24 or Figure 25, 

67. The nucleic acid of Claim 50 Isolated from Tmhoderms reesei, 

58, The nucleic acid of Claim 58 isolated from Aspergillus oiduians. 

59, The nucleic acid of Claim 55 isolated from Aspergillus nigm\ 

60, The nucleic acid of Claim 58 wherein said PTC2 protein has an amino acid 
sequence as set forth In Figure 24, 

61 , The nucleic acid of Claim 55 wherein said PTG2 protein has an amino acid 
sequence as set forth in Figure 2S. 

82, The nucleic acid of Claim 56 wherein said nucleic acid comprises a coding 
nucleic acid sequence as set forth in Figure 24. 

63, The nucleic acid of Claim 58 wherein said nucleic acid consists essentially of 
a coding nucleic acid sequence as set forth in Figure 24. 

84. The nucleic acid of Claim 58 wherein said nucleic acid comprises a coding 
nucleic acid sequence as set forth in Figure 26, 

85. A protein encoded by the nucleic acid of Claim 58. 

88. A protein having unfolded "protein response modulating activity and having 
greater than 70% similarity to the amino acid sequence of Figure 24 or Figure 25. 

57. A protein having an amino acid sequence as set forth in Figure 24 or Figure 

25. 

68. A nucleic acid encoding a IRE1 protein having unfolded protein response 
modulating activity and having at least 80% to an amino acid sequence as shown in Figure 
26 or Figure 27, 
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69. The nucleic acid of Claim 88 wherein said SREi protein has an amino acid 
sequence as shown in Figure 26 or Figure 27. 

70. The nucleic acid of Claim 88 wherein said nucleic acid is isolated from 
Trfobod&rma meseL 

71 . The nucleic acid of Claim 66 wherein said nucleic acid is isolated from 
Aspergillus niduiatw, 

72. The nucleic acid of Claim 68 wherein said nucleic add is isolated from 
Aspergillus niger. 

73. The nucleic acid of Claim 88 wherein said 1RE1 protein has an amino acid 
sequence as set forth in Figure 26. 

74. The nucleic acid of Claim 68 wherein said IRE1 protein has an amino acid 
sequence as set forth in Figure 27. 

75. The nucleic acid of Claim 68 wherein said nucleic acid comprises a coding 
nucleic add sequence as set forth in Figure 26, 

76. The nucleic acid of Claim 68 wherein said nucleic acid consists essentially of 
a coding nucleic acid sequence as set forth In Figure 28. 

77. The nucleic acid of Claim 88 wherein said nucleic acid comprises a ceding 
nucleic acid sequence as set forth in Figure 27, 

78. The nucleic add of Claim 68 wherein said nucleic add consists essentially of 
a coding nucleic acid sequence as set forth in Figure 27. 

79. A protein encoded by the nucleic acid of Claim 88. 

80. A protein having unfolded protein response Inducing activity and having 
greater than 70% similarity to trie amino add sequence of Figure 26 or Figure 27. 

81. The protein of Claim 80 wherein said protein has constitutive unfolded protein response 
inducing activity; 
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82. A protein having an amino acid sequence as set forth in Figure 26 or Figure 

27. 

33. A cell containing a heterologous n«e!e§a ; aefd encoding a protein having 
unfolded protein response modulating activity and a heterologous nucleic acid encoding a 
protein of interest to be secreted. 

84. The cell of Claim 83 wherein said protein having unfolded protein response 
modulating activity is selected from the proteins selected from the group consisting of HAC1 , 
PTC2 and 1RE1. 

35, The cell of Claim .83 wherein said protein of interest is selected from the group 
consisting of lipase, ceilusase, endo-glucosidase N, protease, carbohydrase, reductase, 
oxidase, Isomarase, transferase, kinase, phosphatase, aipba-amylase, gluooamylase, 
iignoeeliufose hemiceilulase, pectSnase and flgninase. 
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T « tees ei immQSSSLW^EASVmBWhShmm^TShFhmmSTl^mmTmB 5 o 

A, n.idulans mSM&FS$VKHEP&-- - - FANSLPT?PSI»BWV1»WS 34 

* ** * , ****„* , • * 

T. r&es&i VAmDSRLSVXPESQOASSDgSHS^SAI^ 3.00 

A. nidtilMilS PADTSXRTKHWAQTRFE<-- — >— ■- •• EKKPAXKRKSWGQEIjF V 69 

** * a * **** ******** ** 

T, rees&i PKT&^PRJCRJUCTSS^ 150 

Yeast *KSTLPPRK?AKTKEEKBQRRlERlL?^RRA^QS5lEKKRr*KLQVI < ERKC 71 

* e ********* fr ****** < -** <( **** ** *** t *** . ^ ** 

t, rmmi mhETLhxmQW m^mmhmwmss^mBSB^mshQmt^sQQ 2 00 

A. »i<&l2aiJS XDKEQQN- - -••OPt^QRLAQ*fE&ER^RX J SOQVAQLSM:VRGSRHSTRTSSS XSS 

Yeast SLLBHLr^SVKLEK-»X,ADHE * * *. *. .. * * 

* * 

Ts re&s&i LFGSRDGQTMSNPSQ3^DQXHRS^I? , rVH?ASLSPSLPPXSE)KSFQTK 250 
A* MdUslms PASVSPTLTPTLPKOBGDEVPIiDRXPFPTPSVTDYSPTLKPSSIjA.E — ~~ 2X2 

T. st&es&i EEDEEQADEDEEMEQThmTKBAAkA^^ 300 
A. $idul$ns — . -~-$mL?QHmV'SVG® 226: 

f, reesei IXiLWPVFSDDAGAItfCLGI^^^ 350 

A , ni&ulsim LBGDSSALTL- FBIiGASXIOiEPX^HDIfTAFLSDDDFRRI.-FSGDSS'IiBSD 273 

** * « * * *** 

. A. nlcfttlaji* SSLLEDGFAFDV LDSGDLSAFPFDSm^DFDTEPWBEBLBQTMGBS 3X9 

„ * ** . * * *** x * 

'P. reesei 7AASR^A,^X!RSLDLEIRDPEWQXPSRHSXQQPQSGASSHGCDDGGlA\fGV 451 

A , ntdulaxis DSASCKAASL- » ~ — - — — - - --•QPSBGASXSRCDSQGXAAGSA 350 

e ** ** 4 ** ***„ ** **t* * 
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1 TTTGAAC&GCAGATCGTTACTGCX^ 

F SQQIVTAY g DVT V HELTED 
61 GATGAATTCTTAGT&ATC^ 

DBF L V I A C D G 
12 1 eac&atetgafcatactaeag<m&rc^S^ 

i W D C Q S S 0 A V V S F 
1 8 1 CGTTCC^C<X:e<mTCSC(^^ 

V R R G I A A K Q D I. Y R I C E H M M D 
241 CAACTGTCTCGC^TCCAACAGT^AGACTGGI'GGAGTTGGC'TfSiGACAACATGACAATGG'T 

&CL ASMSETG G VGCDMMTMV 
302 CATTAmGGTCTCGT^^ 

I X Q h L JST G K 3 1 K £ E W Y H Q X A E R 
361 GGTTeCTAACGC^GAGGGCCCTTGTGCTCCGCCCGAATACGGCAAGTCTCTCGAGGAAGC 

V A M G D G P C A P W B Y S K S L E S P 
42 1 CACGGCCTGCMTCCCmCm^^^ 

'T A $ K * Y * ■ ^ " . 

541 GCXCATTCAACGTGeGTTGTG^ 
601 C&bGSyilUlCAGA&TGACGAGGA^ 

86 1 AAGTGCAACGCCAGAATACCGACACAGAA^GAA^TGACCXiTGAAGGGACGCCTGGGCC^ 
? 2 1 AATCCGCGGCTCGCCAGACGAACACG^CCGCTl'CGGATGGCTCAGAGCCTTCTAAGACAC 
782 &K3*6&AAC^ 

841 G&SIX&BG&hMZGM ■ 
9.01 TGCCGAAGCCTCAGAAT'rCTGCCCCGATACGGTCGCTCATTGATTTCCTGTTTCTCAGGA 
9 8 1 CTTG AAGGCGC ATTGG'TGCTTGTSACCGCGAAGAfGCGAAAGAGACGGAeGAmTCATCC 
102 i CCTTCTATCTCTTOTTTTAATGCCATCTTCTTACTTTTTACGAGCTCATCCAGATCAAA'T 
1081 C ACeTTCGTGTTACTeeAGGATGSA^^^ 
1141 T&CT$T l QQi!ffl 
12 0 1 &&kG1?&CQT^ 
1261 TAfAA 
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61 TCAGCCAeGS^TCG?^ 
121 CGCGtPCCCTOAGTACCmAl^^ 
1 81 CCCTCCTCGACGCTTT^G^^ 
241 CCATC^GCCmACCCTCa^^ 

M 6 Q T L S E P V V E K . T S E K G E D 
301 ACAG&CTCATCTAC^GGTGSG^^ 

3D R L I Y G V S A M Q G W R I S M S D A 
361 ACACGGCrGAQCTClAATCTCCCCC^ 

H T A E h M L P P P S "M D T K T H P D R 
42 1 T0TCC£TTX ? TCGGAGTCW 

L S F F G V F D G H G G D K V A L F A G 
4 S 1 AGA^CATTCACAACATIXSl^ 

E H 1 H N I V F K Q E 3 F K S G D Y A Q 
541 G^CTCAAGGACGSC^ 

G L K. D G F t A T D R A 1 L N D P K Y S 

B E V S G C T A C V T L I A G 8 K L Y V 
©61 CCAACGCCGGTGAT13CTCGAAGCGTGCTGGGCATCAAGGGACGGGCCAAACCCCTA?rGCA 

A J3 A G D S R S V L G I K G R A K 3? X* S 
721 ACG&CCAC^GCC^&GC^ 

KD HKFQLBTEKNRI TAAG G F 
783. TCGACT^TGGCCGAG'rc 

V D F G R V N G W h A I« S R A X G D F E 
841 1?CAAGAAGAGCGCCGAGCTGTCCCCCGAAAftCCAGATCGTT^^ 

F K. K S A B L S P E M & I V T A F P B V 
901 AGGTGCACGAGCTTACAGAGGAGGACGAGl^CC^GGTGAOTGCCTGTGAGGGTATGTGGG 

E ¥ H E h .f E E 15 B F L V I A C D G I W 
961 ATTGCK^IKS^^ 

D C ft S S Q A V V B F V R R G X A A E Q 
1 0.2 1 AOCWWSi^ 

D 3L D K 1 C E H M M D M C L A S M S B 'U 
1081 GTCSCQTCGGCTGCGAC^^ 

G G V G C D 8 M T K V I I G F L H G K T 
1141 AGGAGGAGTGGlATGACGAAATTGCCAAGAGAGTGGCCAilCGGAGACGGCCCCTGTGCCC 

REE W Y D E I A K R V A N G D G P C A 
1201 CCCCGG^TATGCGGAGTTCCGCGGTGCCGGGGTTCACCACA^CTACGAAGACAGCGACA 

P P E Y A E F R G F G V K R M Y E D S D 
1261 GCGGCT ft CG ACGTCG AC GCCGAC AGCGGCGGC AAGTTT AGC GTTGC CGG&TCCCGGGGTC 

S G Y D ¥ D A D S G G K F S L A G S R G 
1321 GCATCATCTTCCTGGGCGACGGCACCGAAGl'CC-TGACXiGG^rrCCGACGACACGGAGATGT 

R I 2 F L G D G T E V L T G S D D T E M 
1 3 B 1 TTOACAATCK^?GACGAGGACAAGG^ 

F D S A D E D K D t A S Q ¥ F R S S G K 
1441 CCGATGCAAASGAGGAG&C&GAGGC^AAOC^ 

T D A K E E T E A K P A P E A E S B K P 
1501 CGGATGGGTCGGA<2AA<^<3£^ 

A B G S E K K 8 .33 B K T P E E S K K D * 
1561 TGGX'CCTCTTG&ATTCT^TGGGC^CG^ 
1621 CGl'G^GTTTOTGfT^Ad-GTGTG^AT 
1 6 8 1 AACCCCAGGCGTG&S83G^^ 
1741 TCTGGAAeGAftACAl^PC^ 
1801 TGl^TGCTCGl^AAAAAAAAAAAAAAMiAAA 
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6 Q K S H R B & K K K' X B S S K B E S D 
TCAO^CCCTGQC&C^^ 1 2 0 

H A I? G T L Q F P A G P 15 A Q L A L T R 
CACTGCATGTAA^GAGGTG^Tmi^^ 180 

T A S St S V F S A D G V I Q X G R L K V 
CYTTACGGCTGACGTTCTGGGl'CATGtSA^ 240 

F T A D V L G H & S H G T V V Y K G S F 
^MO^Ci^KCaTCGesm^ 3 0 Q 

D G a D V A V K R M L V F F Y » X A S K 
CGAAQTGGG&TTGf'''^ 360 

8 V G L L Q B S 0 C H B S V I E C ¥ C R 
imGCAAGCCAAGGGT>Fi?€YTCT^^^ 420 

K Q A K : G F F Y X A L E L C P A 3 L Q D 
TOTGtSTAGAAC^CCAG&CGCGS^ 480 

V V E H S D A F P Q G ¥■ M G G L D H P D 
GS1?C3TOCGTCRMTW^ 540 

V L E Q I V A G V R V L K S L K X V H R 
WBHSPfGAAGOrrc^ 600 

D L K P Q N I Xj V A A F K GRIGS R A 
CATGCGGCT'rCIWOTCGOACYm^ S60 

X R L L X S D F a&CKKLBDNQS S 
ATTCAGGGCAACCACGGCCCATG^lXaCl , GGTACTCCGGGTGGAGGGCTCCCGAACTGCTl > 720 

F R A 1! T A H A A G f P G G G & P H C L 
(SSfim^MmCAAGAG^^ 780 

W H Y T R A G ?5 Q G S F S Q ft T E S S E 
GCCG^CGGTCGfCGATCCCCAGACGAAX^GACGAGCCACCCGAGCCATTGATATCfTCfC 840 

p a v v r> p q i? n a r a t r a x £> i f b 

CeTGGGAfGTGTCTTGTACTACG^ 300 

X» G C ¥ F Y Y V L T E G C H P F D K H G 
l»»S*k&l!GC!B^ 960 

K F M R FAN! V K Q ® F N' X* D S L Q R 
.TdT&IGJIS^ 1030 

X) G E Y A F E A D D L X R S M it A L D P 
ACGTCAACGgfcatgtcccaacaacatcttcctttgccttgtggcgtagcgfcact^atctc 108Q 

R Q R 

cacagCCCCGACGCAAGCGCTG^TTAACCCA^CCTTTCTTCTGGAATCGG'rCCGACCGC 1140 

P D A S A V L T B P F F W N F S D K 
CTTAaGTTCCTCTGTGACGTTTCGGAGCACTTCG AGTTCGAACCGAGAGA'ICC'PCC AYCT 1200 
L S F L C D V S D H F S F F. F R D P P S 
GACGCTCl^TG'I^TGTAGAGTCTGTAGCC^CTGATGTCAXTCGCCCTGAAATGAATCCT X 2 6 0 
D A L L G I< E S V A S » V I G P E M H P 
ClU&CTCCTGCCajUKS^ 1320 
Q 1? P A K. G L Q R Q S R K Q F K Y T G S 
AAAA1?GCTGGACTTGATGCGA*3GCC^GCGGAAGAAGCGCAAGCAGTACAATGATATGCCG 1380 
K M L B L M R A L R H E R H H Y K D M F 
GAGCATTX'GAAAGCTCATATOGGTGGGCTGeCGGAGGGt^TACTTG&MT'TCX'GGAC 1440 
E K L K A H X G G L P E G Y L H F T rf 3? V 
CGOTTCCCGAGXT YG 1500 
R F P S L L M S C H W V I V E L G L OP K 
ACGGATCGGTYCCAAGAGATACTX^ACGCCA^'FGGAGTAGGTTGTTGCG'TACTGGTTCAG 1 S 6 0 
Y 3D R F Q E X F Y A X G V G C C V I; V Q 
AAAYAYATYG 
KYI 
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1 GCACGAGCAAGATACGGCCTCTCSe&CCAAg^^ 

61 T<^SGGTGAA<3GG<3G(^^ 

121 AGCCCACGATCg^CCACAQCCTGCACCACOTTCTCCTCGTCATATTCi^ 

181 C&MSCGGTTTCCOTTGCCTTCGAj^ 

241 TCTAAAGCTAeTGCa'TTGGCTGCTGGGeG^ 

361 GC^2T^TCCTT<XOT^ 
421 C K IX:TACCTCTCe{3TT£^&^ 
481 TCTCCGGCACCCCGAC?3^G^^ 

H 

541 GGTCCGCGTCGeATCAGAAGeTCTCG^GC^^ 

V R V A S E A L L A P A F I L I P S? L Q 

'LADA0QQBQQPQXRXHS0KG 
661 CGACGCQCCCCTTQACAAAGTCGCCGACGATGCCA&CACCC 

D A P L X> K V ADD A N T R W Y A T K A 
721 f|N£^CCAG&C^^ 

APDVHPEA K F D T V H R K Q K Q 0 
781 GTCGACCGCTTCGCCCCAGCAACACCAGAAATATCGACGAGCCCCCTATGACTACGCCAG 

SThS&QQBQK 1 R K A P Y » Y A S 
841 C&AGi^CMGSCCC^^ 

K D K A 0 H R Y A Q H P 1 R E S £ K P S3 
9 0 1. CTACGTAAAAGTCCCC AACGATGCGAGCGCCCTCGC AACTTT AGCTGCGGCf CASCCCGT 

Y V K V £> N D A S A L> A T L A P A Q P V 
951 CCGAGCACCAC^C7iCCTCACGACATCACTGGCCCAGCAGCAGCGCCGC'rrGTGGGCTG<sC 

R A P H T S R H H W B S S S A A. S G L A 
1023- CTC^SCACAATGCGC<^AQTCT(m 

s p h ® a r s i» e 3d w b v s d f v l- l a 
1081 gaecgtcgatggagacctctatgccagcgaccm^ 

t ■ v £ g d l y a s d r r t g r h l vi h £ 
1141 cgaggtcgaccagccagtggttgaaaccaaacactaccgaacaaagaactccg'j'cc'i'cga 

e v d q ? v v b t k k y r t n n s v l d 
1201 cgacgactatcggcccgtcgaccactacatctgggccgtcgagccgagccgcg^tggagg 

d d y r p v b h y i v«i a v e p s r d g g 
1261 gcix:tatgtatggatccccgactccggagcgggcctcgtcaggaccggcttcaccatga^ 

L Y V W I P D S G A G L V R T G F T M K 
1321 GCACX^TCGTTGAAGAAC^GCTCCATACGCCGGCGACGAGCCCCCCG'TTGTCTATACCGG 

H L V E E L A P Y A G D E P P V V Y T G 
1381 AGACAAGAAGAGGACCATGGTCACCCTGGACGCC^TACCGGGCGCGa^CTCmATGGTT 

D K K T T M V T h D A A T G R V L K W F 
1441 TOOCTCTAGCGGKn^CMyCma^^ 

G S B G S Q V N E A E S C L R P N A F D 
1 S 0 1 CGACAGGGATACCACAGAGTGCAGCTCCAT^GGC&CAATCAC 

D R D T T E C S S M G T X T L G R *T E Y 
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1561 CACGGT?GGGGAXK?CAGAGGCGAGAGGG™GGC^ 

t v g i q n a d g a a i a t l k y a e w 

1 6 2 1 GGGACCGAACACCTX^TGACAGCG^CGTCTACC 

G P T F D S D L Y Q Q ¥ H A S L D 8 H 

1 6 8 1 'If ACATCAGC AGTC AGCACG ACGGGAeM.^mc&m , TimCAAGTC&CAGGCAGAAAA 

Y I T S Q B D G E I X A F D K S 0 A 1 N 
1741 e<^CCTGTCCC?CTACAtXXm^ 

D L P L Y T H E F S S P V A R V F D V C 
18 0 1 TCGACCGl'GGGA^GCGAATQCGGGA^ 

RPWDAMA GS M FBLVVLPQ P P 
1861- AAT'fGCAGCGCT^QATQAGAGCA 

I P A 1> D S 8 T V K M R S k S 1 F L S3 Q 
1921 GACTGAAAGCGGCGAGTGGTArGCGCTCTCGGGCGOT^ 

T B S G D W Y A h 8 G R A Y P E X & B A 
1981 CCCCGTGaCXCAG^TCTCGCGGGACGAC1?TG^GGGATATGaCCC ATGCCTm^ATTCCAT 

P V A Q X S R D D L W D M A H A F D S I 
2041 T&ACCCA^TA&SC^Gi'CCA^ 

h p k k h s k a l v g t k f i. n f v k s 
2101 caccggttaccatcagccck:cgacgctccctc<;cggcgccctcgacgagtattacgagga 

T G ¥ H 0; P P T l> P A 6 A h D E Y Y E D 
21.61 •COTGCAGAACGCOTCAAACAk^^ 

L E & A S H T-? A H A V T N T V P & E F T 
2221 CATCATCACGA^AGXCAAGGCTCTTCCGCAGAGTGCTGGG^ACAGCGTCAfTGAC^TG^ 

X I T K V K A £, P Q S A A N S V I f> F V 
2281 CAGCAACCCCArTCTCATCATTT^^ 

S H P X I, I X F L X G S L X Y H E K X E 
2341 GCGACGGTCGXATCATCGGl'O^^ 

R R S Y H & F R *? H G T X K D V Y F F F 
2491 -CGT»A1Ha^TC^3i&^ 

V X ESS A G D S S G D D K D G V F P S 
2461 •T5k2CK^6dSPCW 

S P B F R S Q P Q D Q 35F A £ D H L S R H 
2521 CAAGGTGGAGAGGAATGCCGGCGACCAGGACAAGGTCAAGGACA^CAGGAGCC^GG'ATGA 
XVERNAGDQ D R V KDNRStHO 
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2581 CGSOTCm^CCST^^ 

V S' B T 1 I .P B H K T V E X 2? A O V Y K 
2641 GCAMSTGGATGTAGCTGGCOCX^AC^ 

Q V D V A C P 0 A P S T D S H G A Pi P R 
2"?0I GAAGiyiGAAGMiGGeXCACCG&G^ 

K K K K. A H R G E R G G V K H R K G S P 
2761 CACCGACGGCXCGCAGXeXHXAXanA^^ 

T D G S Q S H 1 S D P A L T T V D E A V 
2821 iU^CJtftfCWGAAG^^ 

S M A K K L & B B P S I> B P D V M X X Y 
2881 C AACG ACAXGC AAGCCGTCACGGSCTCTCTTATCAGCATGGGMACATCGAGGTCGATAC 

R P M Q A V *3? G S V X S M Q K I F V D X 

2 S 4 1 GanTGXCGAGCTTGGCATGGGeAGC^JSGGXACT^^ 

D V E I> G M G S M G T V V P A G R F D G 
300 1 CAGGGAOGXCGCC^TCAAGAGAAXGACGAXTCAGG^ 

R D V A V K P M T I Q P Y O I A *S R E T 
3061 TARGTTGCTGCGCG AGAG'IXs ACGAGC ACCCCAATg t a&s t c a gc c c t ca t c gt t. fcc a C c c 

K L L R S S D D K F Jl 
3121 attttcGCttcgcr.aacgtaaccactgtctgcacGl'CATTCGGTA'I^ACTCACAAGTGCA 

V 1 R Y Y S Q V Q 

33.81 GCGAGGCGACTX^CTGXATATTGCCTl'GGAA 

R G » F X Y X A L E R C A A S L A D V I 
3241 TOAA&AGCCGTAX^em^l^^ 

B K P Y A F G 35 h A K A G Q K X> X, P G V ' 
3301 C^TGmeCAAATCACCAACGGCM^ 

L Y Q X T Sf G X S H X. H S :L £ X V : R E D 
3361 CfTCAkGCCTCAAAAaV^ 

X K P Q H I 1. V H L D K D 6 R P R £> X» V 
3421 GKGGACXCTGC^'C'&GTGTAAGAAACTGGAGGATAGACXvGTCTTCGTXCGGAGGAAGGAG 

$ S> F G L C £ K L S £ R Q S S F G A X* 
34 81 AGGCCGAGCCGCTGGAx^CGO'CGGGATGGCGX'GGCCCCGAACTGCTTCTCSATGACGACGQ 

G R A A G T S G W R A R E X I< L D -0 •© G 

3 5 4 1 .ACAOAAXCCCGCAGCC AXCGAXAGG^GX'ACGG AC AGGGGCTCTCACACCATCCTCGXGGG 

Q ^ P A A I "D S S 3* H S G S R T I & V G 
3 g 0 1 AGACCCCAACX'CGCTTTCCAAXGG AGGGCGAGCCACGAGGGCCATTG ACATCXTC'TCCCX' 

D P H S !»: S M G 0 R A X S A X D X F S Z* 
3661 0?GGCCX'TGTCTOCXl'C'TACGX^CXCAeCAATGGATCCCACCCGTX^GA 

G L V F F Y ¥ X* T G S H P F S C G D P 

3721 AXATATGCGGGAGGTGA&CATXCGAAAGOGCAAC 

V M R. K V N X R S Q M Y ISf 1* 0 F L D A L 
3781 GGGCG ACXXTGCCTACG A^GCCAa.GGATCTG ATTGCGTCCx^XGC'TCC AGGGCXCXCCC Ak 

G D F A Y E A K D I* X A S M L Q A S P R 
3841 GGCAC6ACCX^ACTOGC{^GAG<^fc^ 

A R P P S R E V M A H P F F W S R K K P 
3901 TCaWCCXXTi^GXGCGACGXGXCGCAX'TCTCTGGAGA^GGAGG'I^CGAGATCCTCCGX'C 

L A F X C D V S C S ts E K E Y R D P PS 
3961 GGCXGCCX'TGGTCGAGCTGGAGCGACATGCGCCGGAGGTCA'PXAAGGGAGAC'I!XCXTGAA 

R A L V S i, E R H A P E V I K G t> F L E 
4021 GGlKlCTCACGCGCGAC'I'XTG't^GAaa^^'^GGGCAAGCAGCGC^ 

V X X R D F V E S Is G K Q R K Y X G 8 K 

L !> D L X R A % E H K R t<f H Y E » U S D 
4 14 1 ClICGCTGAAGCGCAGi^i^Q^EC^C^ 

S X< k R S V ■« 3 E P D G Y L A Y W X V K 
4201 GTTCCCGATGCXGXTGCTGACGTGGX'GGmGGXGGTGXAXAATCTCG 

F P M L L X T C W IS V ¥ i' L E W R K X 

4261 GGATCGGXXCAGGGAGrAGmTGAGCC'IGCGGGAXXGXAGAaGAAAG 

D R F R E Y Y R P A G L * 
4321 AAGAAAGGCei\:XXGenX^^ 

4301 AAATAX^IGGGGAAGOTGCAXGGGAAG^GAACAAAAGAGGGGAAA 
4441 aa^GTCeGTTAGCOG^GS^ 
45 GX CATCAAAAGCGT?GTGTTT3!C©ISl?GT^T 
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